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Methods of Interpreting Diagnostic Data 
for Locating Faults in Digital Machines 


By H. Y. CHANG and W. THOMIS 


(Manuscript received August 29, 1966) 


Several techniques for translating the results of diagnostic tests into 
specific fault identities are described. This translation can be difficult in 
large and complex machines. The amount of test data required to isolate 
faults, and the obscure symptoms some faulis generate, preclude efficient 
manual test-by-test interpretations. 

The additional observed fact that a significant number of faults yield 
enconsistent test results from diagnosis to diagnosis demands a flexible 
interpretation of data. Techniques are described for producing fault dic- 
tionartes which can be used by the maintenance craftsman to identify ma- 
chine faults in a relatively short time. These techniques utilize multidimen- 
stonal geometric representations of diagnostic resulis, methods for identifying 
and ignoring inconsistent tests, pseudo-random mappings, and other pro- 
cedures for condensing and organizing the information contained in 
diagnostic test data. 

The results of applying these techniques to data obtained from the Bell 
System’s No. 1 Electronic Switching System are also discussed. 


I. INTRODUCTION 


The problem of locating faults in digital systems 1s becoming more 
acute with the increased complexity of these machines and their ex- 
panding use in real-time applications. The need for automatic testing 
techniques for locating faults by automatic programmed diagnostic 
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tests has been realized for some time and is being actively pursued in 
many areas.’:? This need has particular urgency for a system designed 
to provide uninterrupted service. Such systems require extensive sub- 
system duplication as well as facilities for rapid fault isolation, loca- 
tion, and repair. The Bell System’s No. 1 Electronic Switching System 
(No. 1 ESS), designed to control telephone switching functions, is an 
example.? 

When machines were smaller and simpler, even the use of diagnostic 
tests to provide relevant trouble location symptoms was rare. An 
expert could usually locate the trouble by a quick survey of the be- 
havior of the machine. This may be described as the “eureka” ap- 
proach. 

With large complex machines, however, analysis of symptoms by 
mere observation is lengthy and costly. Strange behavior sometimes 
occurs which even the expert is hard put to explain. Even the addition 
of test points and special diagnostic tools such as programmed testing 
may not immediately clarify the situation. Further, the sheer quantity 
of test data necessary to isolate a trouble to one of the myriad of 
components comprising the machine often demands preprocessing 
before presentation to the maintenance craftsman. 

The techniques described in this paper evolved during the develop- 
ment of No. 1 ESS. They were devised in response to the need for 
translating the output of diagnostic programs into specific fault identi- 
ties. Such translation techniques had to use a minimum of real time 
and memory while being accurate and rapidly applicable by rela- 
tively unskilled personnel. Our results were obtained by applying these 
techniques to the data generated on the No. 1 ESS. This system and 
particularly its central processor is sufficiently like other digital ma- 
chines that we believe our conclusions have some general validity. For 
readers who are not familiar with No. 1 ESS, a brief description of its 
functions and maintenance plan is provided in Appendix A. 

Section I reviews several conventional diagnostic data interpreta- 
tion techniques—the so-called exact-match presentations, and points 
out some of their shortcomings. Section II reports some general facts 
concerning the inconsistency of test results and suggests a number of 
solutions to the problem. These solutions fall into two general cate- 
gories: the phase dictionary approach and the cell dictionary approach. 
Section III describes the phase dictionary approach, its implementa- 
tion in No. 1 ESS, its advantages and disadvantages, and also discusses 
some alternate techniques. Section IV introduces the cell dictionary 
approach and describes how it was implemented on No. 1 ESS. Evalua- 
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tion results, obtained by applying these techniques to data obtained 
from No. 1 ESS, are discussed in Section V. 


1.1 Terms and General Background 


A diagnostic test consists of the application of special inputs to a 
machine for the purpose of locating a possible fault. The correspond- 
ing responses are termed the test results or diagnostic data. A fault can 
be defined as a physical defect in a logical element which will cause 
incorrect machine operations. Test results are generally processed and 
interpreted, either automatically by programs or by other manual 
means, to give field maintenance personnel the necessary information 
to locate and identify the faulty circuit component or packages. A 
circuit package is the smallest replaceable module of a machine. In 
No. 1 ESS, it consists of a relatively small number of components 
mounted on a printed wiring board.® 

There are at least two types of programmed procedures for fault 
diagnosis: the “combinational” approach and the “sequential” approach. 
In the combinational approach a fixed set of tests 1s applied to the 
machine, and the results analyzed to identify the fault. The identifica- 
tion process generally utilizes a fault dictionary which is a listing of 
the test results of known faults organized in a fashion convenient for 
look-up.? In the sequential approach, the set of tests applied to the ma- 
chine is not fixed.* The result of each test is used as a basis for deter- 
mining the next test to be applied. Each fault, or a group of faults 
giving identical diagnostic results, is then identified by a certain test 
sequence—no additional data analysis or dictionary look-up is neces- 
sary. It is noted that this distinction is to some extent academic. A 
sequential analysis can be performed on data generated by the com- 
binational approach. In cases where faults cause large numbers of 
tests to give inconsistent results, this may be advantageous since the 
sequential approach will be costly in terms of memory storage. Only 
if the average running time of the sequential approach is significantly 
less than the combinational approach and time is at a premium will 
the sequential approach be a better choice. For these reasons, the com- 
binational approach is used in No. 1 ESS. 

The fault diagnostic information necessary for generating diction- 
aries can be obtained by two fundamentally different procedures. The 
two approaches are known as “program simulation” and “hardware 
simulation.” In program simulation the logic description of a machine 
is compiled into a computer program which is designed to simulate 
the behavior of the object machine. A particular fault can then be 
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“introduced” into the object machine simply by appropriately chang- 
ing the program description of the machine’s logic. Subsequent logical 
simulation of the machine under control of its diagnostic program re- 
veals the object machine’s actions under test in the presence of the 
fault. In hardware simulation, faults are physically introduced into a 
real machine by replacing good circuit components with catastrophic 
failures such as shorts or opens. The diagnostic tests are performed 
and results recorded each time a fault is inserted. A fault dictionary 
can then be generated by processing test results obtained by either 
method. 


1.2 Straightforward Dictionary Presentations 


One method of presenting diagnostic data for dictionary use is 
simply to list for each fault only the failing tests. This method is quite 
efficient when, on the average, few diagnostic tests fail.5 A sample page 
(with added comments) of such a listing appears as Fig. 1 (a). 

In this example, the tests were grouped into so-called test phases 
such as phase A, B,...G, H,... etc. The tests in each phase are 
numbered sequentially. Ordering of entries in the dictionary is first by 
phases in alphabetical order, then by test numbers within the phase. 
This type of dictionary was employed in the first Electronic Central 
Office which was in commercial use at Morris, Illinois between 1960 
and 1962. A detailed description of its format and implementation is 
contained in Ref. 5. 

As can be seen, such a technique enables a relatively unskilled 
maintenance man to trouble-shoot by merely searching for matches of 
any particular pattern with entries in the dictionary. Furthermore, in 
this representation, the exact configuration of test results is preserved. 
This feature may be useful when dictionary look-up fails to locate the 
trouble. The maintenance man may be able to locate faults by a direct 
examination of the test pattern with the aid of other documents such 
as diagnostic program listings, logic flow charts, etc. 

However, this type of presentation suffers from the disadvantage of 
bulk when the diagnosis is of any considerable size (i.e., more than 
about 1000 tests). Further, large numbers of diagnostic tests result 
in large and complicated patterns, which in turn increase the diffi- 
culty of finding matches with dictionary entries [see Fig. 1 (b)]. This 
technique is one of a class of methods called exact-match techniques 
since an exact match between a test pattern generated by a real fault 
and a pattern in the dictionary is required to identify the trouble. 


INTERPRETING DIAGNOSTIC DATA 


PHASE G 
166-167 170 
PHASE H PARTIAL 


C/RCUIT 


293 


PHASE A 


000 001 004 006 007 
O10 011 013 014 015 
O17 020 021 024 025 


061 _--7 7 PACKAGE 026 027 030 031 032 

PHASE H eee Sahat 033 034 035 036 037 

O21 soe CIRCUIT o40 O41 O42 O43 O44 

a see O45 046 O47 050 O51 

ue se 052 053 055 055 056 

; ae ie 057 060 061 062 063 

& pa ee = 064 065 066 067 O70 

as pene O71 072 073 O74 O75 

S1-322 F-F52616-11 Die OFC soe te aoe 

oer eee eae 104 105 106 107 110 

ae te pesen 111 112 113 114 115 

S1-3 =H5 =) 116 117° 126. 121. 122 

KKKKKKKK KKK KKKKHKKKKKKHKHKEKE ee 154 ee 133 134 

135 136 137 140 141 

PHASE G 142 143 144 145 146 

166 167 170 ORDERED LIST 147 150 151 152 153 

PHASE H PARTIAL eee OF FAILING ___ ~~ >1 3154 155 156 157 161 

O61 Pere = 162 163 164 165 166 
021 O76 : on 

PHASE B 
S0-316 R-F52620-1 }<-- Q21 022 025 026 033 
$1-322 P-F52620-1 aS 034 037 O40 O42 043 


KEKKKKH KKK KHKKEKEKEKKKKKEKEE 


PHASE G 
166 167 170 

PHASE H PARTIAL 
061 

PHASE H 
076 


R-F52620-3 
P-F52620-3 


(a) 


SO-316 
S1-322 


\ O44 O45 046 O47 050 

: 051 052 053 054 055 

\ 056 057 060 062 063 
CIRCUIT PACKAGE 


064 065 C66 067 O70 

O71 072 073 O74 075 

076 077 100 102 103 

104 105 106 107 110 

A 111 112 213 124 115 

. a16 117 120 123 124 
. 12 


~>~—-» CC-306 


LOCATION AND 
TYPE PLUS 
FAULT IDENTITY 

\ 


P-F52617-16 


(b) 


Fig. 1— Sample page of fault dictionary. 


1.3 An Improved Exact-Match Technique 

In order to reduce the problem of dictionary bulkiness and the 
difficulty of manual look-up of large and complicated fault patterns, a 
“number generation” technique is used in No. 1 ESS. This technique 
is essentially a “hash” storage technique which effectively “reduces” 
each diagnostic to a smaller fixed-length decimal number by means of 
a psuedo-random mapping.® Conceptually, the probability of mapping 
two or more large binary numbers into the same decimal number of 
relatively small size can be made arbitrarily small by a proper selec- 
tion of the decimal sample space. A detailed discussion on the particu- 
lar technique used in No. 1 ESS is incorporated as Appendix B. 
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One interesting result is that if the mapping is truly random, the 
expected number of diagnostics that will map into the same random 


number is 
1) 
B=k-N(i-(1-4 ,; 


where N is the number of diagnostic test results to be mapped and k 
denotes the smallest integer greater than or equal to logio N, 1.e., the 
number of digits required in the decimal number. Thus, for No. 1 ESS, 
in which the largest sample of fault patterns for any unit is no greater 
than 105, a 12-digit representation will suffice to insure that the number 
of replications, as a result of the data reduction mapping, will be less 
than one (if, of course, the mapping is truly random). 

Fig. 2 shows a sample of a No. 1 ESS exact-match dictionary entries. 
Hach of the 12-digit numbers was derived from a fault pattern by a 
pseudo-random manipulation of the pattern. As described in Appendix 
B, the results of this pseudo-random mapping method agreed very well 
with the theory. 

It is estimated that a five-to-one reduction in bulk was thus achieved. 
In addition, it was found that the time required for look-ups was 
greatly reduced. 


5155. 5302 ~30-36, A006 
5757 2149 -24-28 , AOT4 
5757 6284 -10-46, A094 
5758 - 3201 


a 
EQUIPMENT LOCATION 
5758 4144 6651 5 .bhe30 AooE | _.ANO GiRcurT 
gees = SIGN (903 O10 16 Kegs SE 
% 0-10-08 , A093 
0-12-06, A093 
0-24-22 , A006 
5764 2170 0-26-28 , AOO4 
5768 1286 0-14-38, A011 
5772 6230 0-21-41, A008 
5776 0873 0-26-10, A003 
5776 3084 0-26-14, A003 





Fig. 2— Sample format of No. 1 ESS dictionary entries. 
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1.4 Shoricomings of ‘‘Exact Match’’ Techniques 

A major problem in using such approaches is that occasionally a 
diagnostic generated in the field does not exactly match a dictionary 
entry. This will arise if the results of the field diagnosis of a given 
fault differ from those yielded by a diagnosis on the same fault which 
was used to prepare the dictionary, i.e., the test results are inconsistent. 
There are many possible causes of inconsistent test results. Some of the 
more obvious ones are (2) improper machine initialization, i.e., if the 
effect of the fault is such that it prevents the machine under diagnosis 
from being properly initialized, the test results may then vary from 
time to time, depending on the state of memory elements at the time 
the fault occurs, (72) presence of intermittent or marginal faults, 1.e., 
faults that cause machine malfunction at some times but not at others, 
and (12) other factors such as the presence of noise and/or variations 
in circuit component values, etc. Consequently, supplementary tech- 
niques are desirable. 


II. GENERAL FACTS CONCERNING INCONSISTENT TEST RESULTS 


Suppose one had the fault patterns for a number of sample faults 
that were simulated on a test model of a digital machine for purposes 
of producing a fault dictionary. Further, suppose that these same 
faults were introduced into a different but supposedly identical ma- 
chine and the fault patterns collected. A comparison of the two sets of 
data is revealing. 

Two such sets of data were collected using No. 1 ESS. A sample of 
faults was inserted in the central processing unit of a No. 1 ESS office 
in Chase, Maryland and compared with dictionary results obtained 
from another No. 1 ESS at the laboratory. The sample consisted of 302 
faults selected so as to be both well distributed and representative of 
expected troubles. Of the 302 faults inserted, 58 produced printouts 
that could not be found in the exact-match dictionary. For various 
reasons, only 40 of these 58 inconsistent printouts could be analyzed.* 
This experiment and a comparison of the test results of these 40 faults 
with their corresponding dictionary patterns show that: 


(1) About 15 to 20 percent of the data for corresponding faults 
disagree. 

(72) Among the diagnostics that are inconsistent, the majority differ 
in only a few bits. 


* This experiment is reported fully in Section 5.1. 
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(7) Furthermore, these differences generally cluster, i.e., a rela- 
tively small group of adjacent test bits are affected. Several groups 
may be affected but in general, the groups do not consist of more than 
about 25 tests (about 4 percent of the overall diagnostic). 

(iv) Within these clusters the diagnostics produced by the test 
model tend to have somewhat fewer failing tests. 

(v) Only about 25 percent of the inconsistent diagnostics have ex- 
tensive differences. 

These observations are qualitative in nature and can only claim to 
be representative of No. 1 ESS. However, they serve to suggest a 
number of ways to attack the problem. 

Observation (ti) above suggests that two differing patterns for the 
same fault could be made to match if the cluster of differing test bits 
were masked out. This idea resulted in the Phase Dictionary, the 
Phase Prime Dictionary, and the Test Group Dictionary (see Sec- 
tion ITI). 

Observations (72) and (2v) suggest that the two differing patterns 
are related and that their relationship could be expressed perhaps by 
some function of their Hamming distance. This idea resulted in various 
forms of ‘Cell’ Dictionaries (see Section IV). We shall consider first 
the Phase Dictionary and some of its relatives. 


III. PHASE DICTIONARY APPROACH 


3.1 Characteristics of Diagnostic Data 


The usual technique used to design diagnostic tests for a large and 
complicated machine is to divide the machine functionally into many 
small and disjoint (if possible) logic blocks. A logic block can be taken 
as a group of functionally related circuits, such as an order decoder 
or an index register, etc., whose input-output terminals are readily 
accessible. The tests are then designed to pinpoint faults which may 
exist in each logic block, assuming other logic blocks in the machine 
are faultless. The overall diagnostic is therefore composed of a con- 
catenation of test results of many so-called test phases, each of which 
consists of tests that are aimed at testing a particular logic block. 

Normally, when a fault occurs, it is expected that the fault will be 
detected by many of the tests that are specially designed to test that 
part of the circuitry where the fault les. Thus, one would expect that 
in each overall diagnostic, the test failures would be roughly dis- 
tributed over a certain number of test phases, rather than over all test 
phases. This is, indeed, the case as one examines, for example, the 
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diagnostic data of one unit of the No. 1 ESS central processor complex, 
the Central Control? (see Fig. 3). The Central Control has a total of 
28 test phases; the overall diagnostic is about 5000 bits long. Out of 
102,518 faults simulated, over 97 percent of them yield diagnostics 
which indicate some-tests-failed in only four or fewer test phases.: 


3.2 General Description of Method 

The essential idea of the Phase Dictionary approach lies in the 
notion of identifying “phase diagnostics,” i.e., the failure patterns of 
individual test phases. That is, in addition to the normal “exact- 
match” dictionary (as it is described in Section 1.3) one further pre- 
pares a supplementary Phase Dictionary which is divided into many 
chapters, each of which is produced by processing each diagnostic 
phase as if it were the whole diagnostic. Then a fault would be re- 
dundantly identified by a number of dictionary entries: one in the 
exact-match dictionary and many in the phase dictionary, as many as 
the number of test phases that failed in its overall diagnostic. For 
example, a fault, f;, which has failed some tests in test phases 2, 3, 4, 
and 6 would be identified by an entry in the exact-match dictionary 
and four additional entries in the phase dictionary, one each in 
chapters designating test phases 2, 3, 4, and 6. 


x10° 


30 DISTRIBUTION OF NUMBER OF FAULT DIAGNOSTICS, 
R,WITH P TEST PHASES FAILED 


_ 


—_ 


NOTE: 
TOTAL NUMBER OF TEST PHASES = 28 


TOTAL NUMBER OF FAULTS SIMULATED 
= 102,518 


en nnn SE oo 
2 4 6 8 10 12 14 16 18 20 22 24 26 28 
p 





Fig. 3— No. 1 ESS central contro] diagnostics. 
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Now suppose f; is diagnosed in the field and its failure pattern 
obtained. Five numbers will then be generated: a normal exact match 
dictionary number, which is generated by the overall pattern of the 
diagnostic and four phase numbers, each of which results from manipu- 
lating diagnostics of test phases 2, 3, 4, and 6. The maintenance person- 
nel normally will first try to find the overall number in the exact-match 
dictionary. If there is a match, he can simply replace the circuit 
package(s) indicated by the dictionary entry. If, on the other hand, 
he cannot find a match, he will then consult the phase dictionary 
by matching phase numbers with dictionary entries in chapters 
2, 3, 4, and 6. Assume the inconsistency in the diagnostic is small and 
is confined to, for example, test phase 3 only. The maintenance man 
will probably discover that he can successfully match phase numbers 
for test phases 2, 4, and 6 with some entries in chapters 2, 4, and 6. 
He will not, however, find a match in chapter 3 because the field diag- 
nostic of f; which generated the phase number for test phase 3 differs 
from: the diagnostic which was used to generate the dictionary entry. 
Nonetheless, the maintenance man can still examine the circuit pack- 
age(s) indicated by the phase dictionary entries of test phases 2, 4, 
and 6 where he finds a match to determine which package(s) is to be 
replaced. Since he knows that the circuit package(s) associated with 
fault, f;, ideally would be listed under all of these entries, he should, 
therefore, select the package(s) that has appeared the greatest number 
of times. This majority rule approach not only leads one toward re- 
placing the proper circuit package(s) but also reduces the number of 
unnecessary package replacements. 

A slightly more sophisticated form of the phase dictionary can 
eliminate the need for the somewhat laborious majority rule approach. 
The phase dictionary consists of entries formed by masking out all 
failing test bits except those in a single phase. The Phase Prime Dic- 
tionary consists of entries formed by masking out only a single phase 
at a time. Thus, if tests in phases 3, 4, and 5 failed during diagnosis, 
a number for phases 3 and 4, one for 8 and 5, and one for 4 and 5 
would be produced. Then if the inconsistencies fall only in say phase 8, 
only the number produced for phases 4 and 5 could be matched. Note 
that the majority rule is not needed here as the dictionary for phases 
4 and 5 has already performed that function automatically. 

The phase prime dictionary, of course, is a little less general than 
the phase dictionary in that it is useful only if inconsistencies are 
confined to a single phase. 
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3.3 Implementation in No. 1 ESS 

The general program flow for implementing the phase dictionary 1s 
shown in Fig. 4. The diagnostic data obtained through simulation are 
originally stored on magnetic tapes. Each diagnostic is read into core 
and is divided into many segments, one segment of each test phase. A 
phase dictionary number is then generated for each test phase by 
manipulating diagnostic data in that phase. The phase dictionary 
number computation is performed for each phase of all fault diag- 
nostics. All phase dictionary numbers of the same test phase are 
grouped together and sorted. The sorted listings are then printed to 
form the various chapters of the phase dictionary. 


3.4 Advantages and Disadvantages 


The major advantage of the phase dictionary over the exact-match 
dictionary is its ability to locate faults whose diagnostics are incon- 
sistent. That 1s, if the inconsistency of test results is confined to a 
small number of test phases, the phase dictionary can still locate 
faults by matching phase dictionary numbers. Further, the use of 
phase dictionary numbers also offers a possibility for identifying mar- 
ginal faults. This can be done by repeatedly exercising (on-line) cer- 
tain phase(s) of the diagnostic tests and then matching the phase 
numbers, since repeatedly exercising all diagnostic tests could be ex- 
tremely costly in terms of system real time. | 

The phase dictionary in general tends to be bulkier than the exact- 
match dictionary. This is because each fault is multiply listed in the 


C ) DATA 
SEGMENTATION 


TEST DATA 
FROM 
SIMULATION 











PHASE 
NUMBER 
COMPUTATION 








PHASE NUMBERS 
GROUPING 


PHASE 1 
PHASE 2 






PHASE Nn 


Fig. 4 Program flow for phase dictionary implementation. 


300 THE BELL SYSTEM TECHNICAL JOURNAL, FEBRUARY 1967 


dictionary. For the particular example shown in Fig. 3, a phase 
dictionary is approximately 24 times the size of the exact-match 
dictionary, as can be determined by calculating the average number 
of failing test phases per diagnostic. Another shortcoming of the 
phase dictionary is that if the inconsistency of test results affects 
all test phases or if the phase which the inconsistency affects 
is the only failing phase in the overall diagnostic, the fault cannot 
be successfully identified with this approach. Although wide variations 
in test results for the same fault are less likely to arise in a good 
diagnostic design, they nevertheless represent a problem. 


3.5 The Test Group Dictionary 

A third form of dictionary that uses the idea of masking out incon- 
sistent tests has been investigated. This dictionary was constructed by 
considering each test of the diagnosis independently and determining 
what faults caused this test to fail. Those faults would then be 
grouped and associated with that test. When this was done for all tests, 
the Test Group Dictionary was formed. 

It was originally hoped that the list of faults associated with a 
given test would be small enough on the average so that a majority 
rule technique could be used to advantage when analyzing a field 
diagnostic result. Unfortunately, in all of the cases implemented so 
far, the average listing is well over 30 faults per test. This precludes 
its use as a manual tool. If stored on tape, however, and searched by 
machine, it gives promise as being a valuable laboratory tool. It might 
also be useful if the maintenance facilities for a number of machines 
were centralized. 


IV. CELL DICTIONARY APPROACH 


4.1 Introduction 

The phase dictionary approach is a technique for finding exact 
matches between patterns by eliminating those portions of the patterns 
where differences are found. The “cell” dictionary approach, on the 
other hand, is not concerned with exact matches between patterns but 
with near matches. In the cell approach, the entire fault pattern is 
examined and a measure of its “similarity” to other patterns in the 
dictionary 1s made. On the basis of this measure, those faults in the 
dictionary associated with the most “similar” patterns are identified as 
being more likely to have caused the observed fault pattern.* This 


* This assumption can only be justified ultimately by satisfactory results in its 
application. 
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then provides the maintenance man with a list in order of probability 
of faults to repair. 

Clearly, the efficacy of this approach depends on (1) the way in 
which “similarity” is defined and measured, and (i) the manner in 
which this measure is used. 


4.2 “Similarity” and ‘‘Dissinularity’”’ of Diagnostics 

If every test in a diagnosis was of equal significance in the task of 
isolating faults, one could say that two diagnostic test patterns were 
dissimilar in proportion to the number of corresponding tests in which 
the two patterns differ. If the diagnostic results were expressed as 
binary numbers, their dissimilarity could be expressed as having a 
direct relationship to their Hamming distance. The greater the Ham- 
ming distance, i.e., the more places in which two patterns differ, 
the less alike the patterns are. Conversely, the smaller the Ham- 
ming distance, the more similar two patterns are. Consequently, 
Hamming distance can be used a measure of both similarity and dis- 
similarity. 

Hamming distance is a readily computable measure. And further, it 
can be easily modified to take into account the fact that diagnostic 
tests differ in significance. For the purposes of explaining the idea of 
significance, consider a small sample of faults and just those tests of 
the diagnosis which fail for at least one fault in that sample. Suppose a 
number of these tests always pass or fail together as a group for any 
given fault. Then the results of the entire group could be predicted by 
examining the result of any one of the tests. Here the information 
provided by each test in the task of locating a fault is the same but 
diluted by a factor equal to the number of tests in the group. One 
would say that the significance of these tests was low. On the other 
hand, a test whose result cannot be predicted by the results of other 
tests would have a higher significance. Intuitively speaking, the signifi- 
cance of a test should also be affected by its consistency. Less signifi- 
cance should be attributed to inconsistent tests. 

A natural and satisfying way of expressing these considerations 
quantitatively would be to assign weights to tests in accordance with 
their significance. Section 4.9 gives a brief discussion of some mathe- 
matical techniques available to do this.® 

Suppose now only that each test has been assigned a weight. Then 
similarity would be computed by summing the weights of the differing 
tests in the patterns. Similarity is, thus, now measured by a weighted 
Hamming distance. Whether or not test weights have been assigned, 
however, similarity is measured by an easily calculated number. 
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4.3 A Problem 


It would be possible to store the dictionary (i.e., all the fault pat- 
terns) in a computer memory and to use the similarity measure just 
described to search the dictionary for patterns similar to some arbi- 
trary pattern when desired. As a practical matter, however, the time 
required for the search and the storage required for the dictionary are 
prohibitive. In the case of No. 1 ESS, for example, it is estimated that 
about 2.5 < 10% 36-bit words would be required to store the dictionary 
for the central control alone. The time required for a single search 
would be on the order of 10 minutes provided the dictionary were 
stored in core (rather than on tape or disk) and provided the central 
control did no other work. Since No. 1 ESS is a time-shared machine, 
no one job is run for longer than 10 percent of any extended interval. 
Consequently, a single search would run well over one hour. Clearly, 
a more sophisticated method is needed. 


4.4 A Geometric Model for the Dictionary 


Consider, for the time being, that diagnostic results are representable 
by binary numbers and that similarity is measured by simple Hamming 
distance. It is possible to construct a geometric analog of the binary 
number system in binary space. In this analog, each binary number 
represents a unique point in the space and the “‘distance”’ between points 
corresponds to the Hamming distance between the patterns representing 
those points. Thus, there are exactly 2” distinct points or patterns in 
an N-dimensional space. 

Suppose that all of the data in the dictionary were placed in some 
N-dimensional space. For example, the dictionary data for the No. 1 
ESS central control consists of about 10° different patterns of order 5000 
(i.e., N equals 5000). Thus, 10° points out of a possible 2°° = 10°” 
points would be taken up by dictionary data. (The space is very sparsely 
populated.) Now suppose some arbitrary pattern (produced by some 
real fault) is placed at point A in the N-space. If point A is already 
occupied by a dictionary pattern, then the two patterns match and the 
real fault is almost certainly the same as the dictionary fault associated 
with that pattern. If, however, point A is unoccupied, we have (pre- 
sumably) the case of an inconsistency. Then it would follow, since 
Hamming distances are preserved in the analog, that those dictionary 
points that are closest spatially will be the most similar ones according 
to our definition. The faults associated with these nearby points would 
then represent those dictionary faults which could most probably have 
produced the inconsistent pattern. 
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4.5 The Cell Dictionary Concept 


Imagine first that a number of points, c;(¢ = 1, 2, --+ , m), in the 
N-space are selected arbitrarily. Now imagine that every point, x, , in 
the space is associated with a c; such that the Hamming distance 
d(c;, x,) is least. This procedure would then produce a number of 
“cells”? C; with “centers” c; , each containing all points 2, , such that 
d(c; , X,) < d(c;, x,), for 7 ¥ 7. Applying this procedure to dictionary 
data will result in a Cell Dictionary. The dictionary would consist of an 
ordered list of those cells which contained diagnostic results together 
with the fault identities corresponding to those results. Each cell can 
be conveniently identified by its c; . Such a dictionary could be used as 
follows: 


(z) Place in memory a list of occupied cells (i.e., cells which have 
diagnostic results associated with them). 

(22) Given an arbitrary pattern, a search would consist of computing 
the cell containing it and finding the two or three closest occupied cells. 

(2t) Likely faults could now be found by consulting the printed 
dictionary. 


This method would be practical if such cell lists were relatively small. 
Unfortunately, No. 1 ESS data does not lend itself to small lists. There- 
fore, a different form of cell dictionary was adopted. 


4.6 Selection of Practical Cell Center Lists 


In this section, we shall discuss an algorithm which permits the gen- 
eration of a list of cell centers (c,’s) that, given an arbitrary pattern, 
can be rapidly searched. 

Consider all binary numbers of order N (i.e., having N bits) and a 
partitioning of the N bits into k equal parts (assuming WN is divisible 
by k). Suppose that all bits in each part, P; , of the partitioning are 
assigned like values—either all 0’s or all 1’s. Then the subset of all 
numbers of order NV meeting the above requirements can be placed into 
one-to-one correspondence with the set of all binary numbers of order 
k. This subset can then be taken to form a set of 2° cell centers which 
divides the N space into 2” equal-sized (i.e., containing the same number 
of points) cells. 

To show that every point is contained in some cell and every cell 
contains the same number of points, imagine an arbitrary pattern of 
order JN is divided into k equal parts. Then all binary numbers having 
more 0’s than 1’s within part P, of this pattern are closer Hamming 
distance-wise to a cell center whose P; is all 0’s. Similarly, all binary 
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numbers having more 0’s (1’s) than 1’s (0’s) within other parts, P;’s, of 
this pattern are closer to a cell center whose P;’s are all 0’s (1’s). It fol- 
lows, therefore, from the definition of a cell that each point is contained 
in some cell. Moreover, if V/k is odd, each point will be contained in one 
and only one cell. 

Since, in general, there are C% possible binary patterns of order n with 
m 0’s (or 1’s),* the number of binary numbers having more 0’s than 1’s 
in P; equals the number of binary numbers having more 1’s than 0’s. 
Since this holds for any P; , and further it holds independently of any 
other P; , it holds for the entire partition. Thus, each cell center will 
have exactly the same number of points closer to it than to any other 
cell center. 

From the nature of the construction of cell centers, a binary number 
can be assigned to a cell merely by partitioning it and counting either 
the 0’s or 1’s in each part. This greatly simplifies and speeds up the 
assigning of patterns to cells. Furthermore, cell sizes can be varied at 
will merely by changing the partitioning. Very large cells will be gen- 
erated if the P;’s are large and vice versa. 


4.7 The Multiple Cell Dictionaries Approach 


When these techniques were applied to No. 1 ESS, it was decided to 
produce a number of cell dictionaries, each corresponding to a different 
cell size. This approach evolved because of the problems encountered 
in apparently simpler implementations. For example, as was discussed 
in Section 4.5, one way of implementing cell dictionaries in the field 
would be first to produce only one cell dictionary from the laboratory 
data. Then, a list of those occupied cells (i.e., cells containing dictionary 
fault patterns) would be stored in the field ESS machine. This would 
enable the machine to take a pattern for a field trouble, compute the 
cell containing it, search the cell list, and print out a number of nearby 
occupied cells. In the case of No. 1 ESS, however, the list of occupied cells 
was very large (requiring on the order of 15,000 36-bit words of memory) 
and search times prohibitively long. The next possibility, which was 
tested and then discarded, was to print the cell dictionary (1.e., an 
ordered list of occupied cells) and search it manually. A search consisted 
of checking whether or not a cell containing a real fault was occupied. 
If it was not, then a check of the nearby cells was necessary. Thus, the 
search required that No. 1 ESS machine compute and print the cell 
containing the real fault and its neighboring cells in N-space. Unfor- 
tunately, the number of nearby cells in N-space can be enormous. For 


* C,,” = the number of combinations of n out of a total of m things. 
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example, there are, in general, C% adjacent cells a distance dN /k from 
any given cell (NV and k are as previously defined and d = 1, 2, 3, ---). 
Thus, if NV = 5001 and k = 1667, the closest cells to the given d = 1 
will be a distance of only 3 away and there will be 1667 of them. For 
d = 2, the number of cells will be about 1.4 X 10°. As a result, unless 
the cell containing the fault pattern was occupied, the time for finding 
any nearby occupied cell could be extremely long. Consequently, a 
modification of this idea, the so-called Multiple Cell Dictionaries ap- 
proach, was finally adopted. 

Suppose the ESS machine: computes the name of the cell that contains 
a real fault, and a check in the cell dictionary shows that the cell is 
unoccupied. Now suppose the machine repeats a similar calculation to 
obtain the name of a larger sized cell. Then, in general, a different 
cell name will result. In order to check whether or not this cell is oc- 
cupied, a cell dictionary corresponding to this sized cell would have 
to be available and searched. Suppose the search was again unsuc- 
cessful; then the machine could compute another even larger cell and 
so on, for as many times as there are available dictionaries. Ultimately, 
as the cells grow larger, they must become occupied cells for some cell 
dictionary. | 

The computation of cells of different sizes for a real fault is relatively 
simple. In order to use the computed cell names, however, a number of 
corresponding cell dictionaries must be made available. Thus, the mul- 
tiple cell dictionaries approach is a trade-off of bulk for search time. 
The increase in bulk, however, is not so great as might at first be sus- 
pected. This is because as the cell sizes increase the number of cells 
decreases and the number of faults in the “zero” cell* (which is not 
printed in general) increases. A qualitative evaluation of the results 
achieved will be presented in Section 5.1. 

It should be noted that a partitioning such that each P; consists of a 
a single bit will result in an exact-match cell dictionary. Thus, the cell 
approach can be extended to cover both the case of exact matches as 
well as the case of inconsistencies. 

The form of cell dictionaries is exactly the same as the exact-match 
dictionary. This is achieved by scrambling cell center coordinates in 
a fashion similar to the procedure for scrambling diagnostic results 
for the exact-match dictionary (see Section 1.3 and Appendix B). The 
scrambled cell center coordinate serves as the cell name when printing 
the cell dictionary. 


* The ‘‘zero”’ cell is one whose center has an all 0’s pattern. 
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4.8 Some Disadvantages 


Although the major advantages of the cell dictionary are fairly 
obvious, some of its disadvantages may not be. 

First, the smallest practical cell sizes are formed with a partitioning 
such that each P, consists of 3 bits. This means that any pattern that 
never groups at least two 1’s within a single P, will fall in the zero 
cell. Since the “all zeroes” diagnostic result represents the healthy 
machine, it could be expected that many faults that cause the machine 
to be “slightly sick” will fall into the zero cell or its neighbors. This is 
indeed the case as revealed in the No. 1 ESS diagnostic data. Thus, a 
fault falling in the smallest zero cell in many cases renders the cell 
dictionary useless because of its poor resolution. Fortunately, the 
situation is ameliorated by the fact that most inconsistencies occur 
with faults which cause many test failures. 

A second disadvantage concerns the process of computing larger cells 
around the real inconsistent fault pattern. It is true that as the parti- 
tioning increases, larger cells containing the real fault are examined 
but two facts should be noted: 


(2) The real pattern will not necessarily be close to the center of 
the cell, and 
(7) The centers of these cells will not usually coincide. 


These facts mean that some faults not necessarily close to the real 
fault may occasionally be implicated and also that a cell may not be 
completely included in the next larger sized cell. Eventually, of course, 
as cell sizes increase, the smaller cells will be completely included but 
at the cost of a greater number of implicated faults. 

Both of these considerations are affected by the algorithm used to 
select cell centers. A better algorithm might eliminate these problems. 
Also, an approach such as Kruskal’s (see Ref. 8) can reduce if not 
eliminate them. 


4.9 Test Weighting 


The modification of similarity measurements by the inclusion of 
test weights should take the following considerations into account: 


(t) The significance of a test relative to other tests in the task of 
isolating faults, and 

(wz) The consistency of the test, 1.e., what is the probability that on 
multiple diagnoses of the same fault, the test will give the same result 
as it gave on previous diagnoses. 
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Item (7) can be derived from the test results obtained from diction- 
ary production. Consider a matrix formed such that each row repre- 
sents the binary diagnostic result for a fault (see Fig. 5). Then each 
column represents the results a given test yields for each of the faults. 
To calculate the significance of a test relative to other tests, first 
obtain the correlation coefficients p;; of column 7 with every other jth 





af 
{ 
| 

(Ool-=== 
FAULTS | fi, 

| 
DICTIONARY | 
MATRIX 
|: 
| 
| 
| 
! 

Y fi, 


PiL=! Pij 


NO. OF CONSISTENT FAULTS 
I NCY FACTOR = ee SO 
eoeete TOTAL NO. OF FAULTS 


Ci 


TEST WEIGHT Wi = —> 
Be Pij? 
jzo 


I 


CARY eS ee 
aoe WEIGHTED HAMMING DISTANCE 


Fig. 5— Test weights. 


column using standard statistical definitions. The relative significance 
of the 2th test which we will call o; is 


| N 
= 1/ Doh. 
7=0 


p is squared to obviate sign difficulties and down-weight small correla- 
tions. 

Item (2) consists of the ratio of the number of faults for which the 
test was consistent to the total number of sample faults.* The con- 


* This ratio may be obtained practically by repeatedly performing the diag- 
nosis on a given fault and observing the results of the tests. This could be done 
during physical simulation (at a considerable time cost). Otherwise, field data 
would have to be collected over a period of time. Kruskal has an idea (yet to be 
verified) of how to predict the inconsistency of a test theoretically. See Ref. 8. 
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sistency factor c; of test 7 can be defined as: 


ee total number of faults for which the test 2 was consistent 
total number of faults 


A test weight w, then is 


Uv; = f(o; Oa) 


A suggested function is 
U; = ¢! -- C; log C; +- (1 os c;) log (1 — C;))o; ° 


The quantity [1 + ¢; log c; + (1 — e;) log (1 — e¢;)] is the usual en- 
tropy function. It essentially makes c; = 4 the zero point (which is the 
value we would expect if the test was completely inconsistent) and 
spreads the intermediate factors appropriately. 

This formulation of test weights is due to J. B. Kruskal. Ref. 8, 
Section VII contains an elegant discussion of test weights excluding 
inconsistency considerations. 

Test weighting has not, as yet, been used with No. 1 ESS data. The 
process of calculating 5000 X 4999 + 2 correlation coefficients (assuming 
a diagnosis of 5000 related tests in a diagnosis) is very time consuming. 
However, results obtained without test weights (i.e., every test given 
the same weight) appear quite satisfactory. 


V. RESULTS AND CONCLUSIONS 


5.1 Dictionary Evaluation Results 

An experiment was conducted to evaluate the effectiveness of the 
No. 1 ESS exact-match dictionary, which is being used in the field, 
and the phase and cell dictionaries, which are being implemented. A 
sample of faults was inserted in the central control at a field No. 1 
ESS office located in Chase, Maryland. The sample, which consists of 
302 central control faults, was selected by persons who were not in- 
volved in the dictionary production project so as to reduce the possi- 
bility of bias in the selection. The faults selected were well distributed 
with respect to their types and locations, and were representative of 
expected troubles. Each fault was inserted when the office was running 
under a simulated traffic load and a diagnosis was performed. The 
corresponding diagnostic printout had three possible outcomes: (2) a 
printout that matched the correct dictionary number in the exact- 
match dictionary, (72) a printout which did not match the dictionary 
number in the exact-match dictionary, or (ti) a printout indicating 
all-tests-passed. 
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TasBLeE [—Exactr—Martcu Dicrionary EVALUATION OF FAULTS 
INSERTED IN No. 1 ESS Orrice at CHasge, MARYLAND 








Faults inserted 302 


(7) Found in exact-match dictionary 216 (72.2%) 
(727) Produced diagnostic printout, but not found in exact- 58 (19.4%) 
match dictionary 
(zit) Produced all-tests-passed printout 25 ( 8.4%) 
(w) Produced invalid data for analysis 3 


Of the 302 faults inserted, there were 216 faults that were success- 
fully located by the exact-match dictionary, 58 faults producing a 
printout not found in the dictionary, 25 faults producing all-tests- 
passed printouts, and 3 faults whose test results were either invalid 
or incomplete for this analysis due to errors made in the fault insertion 
procedure. The evaluation results for the exact-match dictionary are 
shown in Table I. Since the diagnostic programs were designed under a 
pressing time schedule with little opportunity for the feedback of 
evaluation results, we consider these figures quite gratifying. 

Further analysis revealed that out of 25 faults producing all-tests- 
passed printouts, 5 were faults that could not be detected by programs 
because of inherent circuit redundancy and 20 were faults that were 
not detected due to test inadequacy. Out of 58 faults producing print- 
outs which did not match any dictionary numbers in the dictionary, 
9 were those diagnostic data had not been simulated during the dic- 
tionary production process and 9 were faults whose data were incom- 
plete for the purpose of analysis with phase or cell dictionaries. Thus, 
only the data of the remaining 40 faults were analyzed to illustrate the 
feasibility and the effectiveness of phase and cell dictionaries. As 
shown in Table II, only 2 faults could not be located; all other 38 
faults were found in either phase or cell dictionaries. In addition, 80 
percent of the faults located by the cell dictionary were isolated to 
10 or fewer circuit packages. (For this evaluation, only 5 sections of 


TABLE JI—PHASE AND CELL DICTIONARY EVALUATION OF FAULTS 
INSERTED IN No. 1 ESS OFricr at CHasE, MARYLAND 


Faults producing inconsistent test results 40 
(2) Found in phase ictiodnary ya 

(77) Found in cell dictionary 35 

(212) Found in either phase or cell dictionaries 38 


(iv) Found nowhere 2 
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the multiple cell dictionary were constructed. The partitioning of the 
diagnostic for each section was 3, 5, 11, 21, and 41 bits, respectively.) 
This is indeed a significant improvement. However, it must also be 
cautioned that this sample of faults producing inconsistent test results 
is too small to draw any meaningful quantitative conclusions. What 
can be said is that significant improvements over present exact-match 
techniques can be expected and that the cell dictionary approach may 
be somewhat superior to the phase dictionary approach. 

The exact-match dictionary has been in use in No. 1 ESS office in 
Succasunna, New Jersey, since May 30, 1965. Its effectiveness has been 
more or less compatible with our evaluation results. The machine can 
usually be diagnosed and repaired within twenty minutes when the 
dictionary look-up procedure is successful. The phase and cell diction- 
aries will be implemented for the No. 1 ESS office in Beverly Hills, 
California, which will begin service sometime in the fall of 1966. The 
detailed field performance of all these dictionaries is not covered in the 
scope of this paper. 


5.2 Concluding Remarks 

The major advantages of these techniques, as a whole, are that they 
provide the maintenance craftsman with rapid methods for extracting 
the information from diagnostic test patterns for the purpose of faults 
location. The techniques require a very modest amount of machine 
time and memory. They can be quite effective especially if some care 
is taken during the fault simulation phase of dictionary production. 

Each technique has its limitations, however. The “exact-match”’ 
dictionary will not handle inconsistencies. The phase dictionary will be 
of assistance if at least one phase is consistent, but at the cost of reso- 
lution and time consumed while manually searching for fault identities 
using the majority rule approach. The phase prime dictionary will 
eliminate the manual search but will work only if the inconsistency is 
confined to a single phase. The cell dictionary is ineffective when the 
fault falls into the zero cell. 

We think that the results obtained will be fairly typical of what can 
be expected when implementing a maintenance dictionary approach on 
a digital machine. The combination of techniques is not perfect but is 
one of the most powerful for locating faults in real-time systems. 

A logical continuation of this work would probably involve: 


(1) A study of why tests are inconsistent. This would permit a 
better technique for eliminating inconsistencies from test patterns 
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when generating phase dictionaries. It would also permit modifica- 
tions of the measure of “similarity” between fault patterns. 

(21) Development of better cell center algorithms for the particular 
form of cell dictionary described. An “ultimate” cell dictionary is 
probably an implementation of Kruskal’s ideas.8 

(12) Develop criteria for establishing figures of merit for dictionary 
techniques which take into account: (a) the percentage of real troubles 
located (as compared to simulated faults), (6) the resolution, (c) the 
speed or facility with which dictionary look-ups can be made, and 
(d) the machine time and memory processing requirements. 


Some progress has already been made on items (12) and (22) but 
considerably more is needed. 


VI. ACKNOWLEDGMENTS 


The authors would especially like to acknowledge the work of J. B. 
Kruskal whose ideas were the stimulus for much of the work on the 
cell dictionary. We are also indebted to F. M. Goetz for the initial 
suggestions on “pseudo-random” mapping methods and the phase 
prime dictionary. The programming skills of R. E. Archer and J. L. 
Kodner were invaluable, as well as the comments of R. L. Campbell, 
R. W. Downing, L. 8. Tuomenoksa, and many of our colleagues. 


APPENDIX A 


No. 1 ESS System Organization and Maintenance Plan 


No. 1 ESS is a general purpose electronic telephone switching ma- 
chine which employs for its contro! a time-shared multiple-program 
computer operating in real time.? Functionally, the system can be 
divided into a central processor and a peripheral system (see Fig. 6). 
The central processor, which operates with 54 microsecond cycle time, 
provides the data processing facility for telephone, maintenance and 
administrative functions. It consists of program stores, call stores, and 
a central control. The program store, which is a read-only type of 
semi-permanent memory, contains the stored program and translation 
information that are needed to switch calls and provide services as 
well as maintenance programs. The call store, which is a temporary 
ferrite sheet memory, stores all transient information for processing 
calls, such as the digits dialed by the subscriber or the busy-idle states 
of lines and trunks. The central control consists mainly of wired logic. 
Its duty is to coordinate and command all system operations. 
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PROGRAM PROGRAM CALL CALL 
STORE STORE STORE STORE ~ 
Fig. 6— No. 1 ESS block diagram. 


The peripheral system consists of a ferreed switching network, 
scanners and distributors, and a master control center. The switching 
network provides the connections between lines, trunks, and service 
circuits (i.e., auxiliary devices such as tone sources, signal receivers, 
and signal transmitters). The scanners are used to collect information 
from lines, trunks, and points internal to the system. The distributor 
is made up of two units: the central pulse distributor, which operates 
at machine cycle speed, is used for signaling logic circuits, whereas, 
the signal distributor is used for controlling slower devices such as 
relays in trunk circuits. The various subsystems are interconnected 
with balanced ac coupled bus systems. The master control center 
includes a teletypewriter for input-output, a panel for manual testing 
of lines and trunks, and some controls and displays. 

The No. 1 ESS was designed for a high degree of maintainability 
and dependability. These objectives require that telephone service is 
not interrupted even in the presence of internal component failures.? 
To achieve these objectives, the major units of the central processor 
are duplicated, and circuit and program facilities are provided for 
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detecting troubles, locating the faulty subsystems and re-establishing 
an operational configuration without interrupting telephone service. 
Once a faulty subsystem is identified, the diagnostic programs are called 
in to analyze the trouble. These programs are executed (on-line) by 
the operational configuration; they are segmented and interleaved 
with the main call processing program to avoid interference with the 
normal system operation. The diagnostic programs carry out a fixed 
sequence of tests by observing the normal outputs of the faulty 
subsystem or by monitoring some special test points strategically 
embedded within the unit. The combinational testing approach 1s used 
to conserve program storage space and to simplify the processing of 
test results. The pass or fail test data are recorded and processed 
using the exact-match techniques discussed previously (in Section 1.38 
and Appendix B) to produce a compact diagnostic printout on the 
teletypewriter. The translation of a diagnostic printout into the loca- 
tion of the replaceable faulty circuit package(s) is then accomplished 
with the aid of a dictionary. The data for the dictionary are generated 
through hardware simulation, i.e., by actually inserting almost every 
possible simple hard fault sequentially into the unit and then recording 
its diagnostic. The general approach of dictionary production is similar 
to the one on Morris machine® and therefore, will not be described 
here. A sample format of exact-match dictionary entries is illustrated 
in Fig. 2. 


APPENDIX B 


Pseudo-Random Number Generation 


In No. 1 ESS, the diagnostic test results of each fault is represented 
by an n-bit binary number where each bit or a sequence of bits desig- 
nates the pass or fail result of a particular test(s). The number of bits 
n is usually very large, e.g., n = 5000. The “hash” technique used to 
reduce No. 1 ESS diagnostic data to a smaller fixed length number 
employs a “pseudo-random” function which manipulates an arbitrary 
and large pattern of test results to produce a number with relatively 
few digits. The reduction procedure is pseudo-random in the sense 
that the mapping is deterministic but approximates the process of 
assigning one truly random number to each fault pattern. 

This process is analogous to the problem of selecting numbers at 
random from an urn. Assume the urn has N distinct numbers. A total 
of k numbers are selected, one at a time with replacement, from the 
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urn. The probability P that all these & numbers are distinct is 


r-if0-2 


t=1 


k-1 
D4 
jst a: AEE. anni». te ok) 
Pp a) N Aen | “ON ) for k<QN. 


Thus, suppose k denotes the total number of distinct fault patterns 
and d = [logi) N] denotes the number of decimal digits in the diag- 
nostic printout (the symbol [x] denotes the smallest integer greater 
than or equal to x). The probability of generating, at least con- 
ceptually, all distinct d-digit numbers from a pseudo-random number 
generator can be made arbitrarily large by increasing d. However, in 
practice the number of digits d in the printout should be kept reason- 
ably small so as to simplify the look-up process and to reduce the 
dictionary’s bulkiness. Hence, for a sample of 10° distinct fault pat- 
terns, a 12-digit representation would probably suffice, since it yields a 
probability of 0.995 that all resultant numbers will be distinct. 

When the ratio of k to N is not exceptionally small, a few duplicated 
(and replicated) numbers could result. Thus, it is also necessary to 
compute the expected number, L,, of replicated pseudo-random num- 
bers when k fault patterns are assigned random values from a sample 
space N. Suppose #, represent the expected number of distinct numbers 
generated, then the probability that a particular number is selected, 
at least once, from the urn in k selections is 1 — (1 — 1/N)*. This 
probability is also equal to H,/N. Thus, 


r.=m(t-(\-})) 


The expected number of replicated numbers becomes, 


k-1 t 
it, = k yes i, = >> (-1)*c4.4(4) ° 
i=1 N 


Hence, for a sample of 10° distinct fault patterns, the expected number 
of replicated numbers in a 6-digit representation is about 4837 whereas 
the expected number of replicated numbers in a 12-digit representation 
is less than one. 

An experiment was performed to verify the hypothesis that this 
method of diagnostic data reduction is analogous to the problem of 
selecting numbers at random from an urn. A sample of 964 fault pat- 
terns each consisting of approximately 1000 bits of test results was 
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used. Initially, the reduction process just used the addition and rota- 
tion instructions to produce a 6-digit number, nine duplicated re- 
sulted. On the subsequent attempts, shift and other peculiar instructions 
were added to better scramble the data; the number of duplicates finally 
decreased to zero. From the urn analog, the expected number of dupli- 
cates in assigning a 6-digit pseudo-random number to 964 distinct 
patterns is 0.5, and the probability of no duplication is 0.61. 

The experiment demonstrates the feasibility of number generation 
schemes. An effective method can readily reduce each pattern of a col- 
lection of diagnostics to a smaller fixed-length number with very little 
loss in resolution. Since the greatest possible total number of fault 
patterns for any No. 1 ESS subsystem is about 10°, the diagnostic 
printout uses a 12-digit representation. As mentioned earlier, this rep- 
resentation 1s amall enough so that the dictionary look-up process is 
easy, yet large enough so that the probability of all generated numbers 
being distinct is quite high and the expected number of duplicates is 
very small. 

Basically, each fault pattern undergoes two stages of reduction pro- 
cess. In the first stage, each binary fault pattern of n bits is ANDed 
(bit-wise) with each member of a set of m preselected reference vectors 
R,, R,, -:: &,,, and the resultant ‘bit-sums” S,, S., --- , S,, are 
collected (m «< n). That is, suppose the binary fault pattern of fault 
Fis T,T,--- T,(T; = 1 or 0), and the pattern of reference vector R; is 
rir)... ri(ri = 1 or 0). Then the bit-wise ANDing operation will gen- 
erate a “‘bit-sum”’ S; where, 


S; = > Je ri 
i=1 
and 7 = 1, 2, --- , m. To further reduce the size of the fault pattern, 
each set of bit-sums S,, S,, °°: , S,, undergoes three independent 


stages of ‘“‘data scrambling’? manipulation, each resulting in a 4-digit 
number. Each stage 1s a pseudo-random number generation procedure 
based on the shift, rotation, and addition orders. 

The final diagnostic printout is, therefore, a 12-digit number, formed 
by a concatenation of three 4-digit numbers. Fig. 7 shows the general 
program flow of the final reduction process. 

The final three stage reduction process used was evolved through 
experimentation. The resolution of the dictionary so generated was quite 
high, high enough so that most entries in the dictionary associate with 
only four or fewer circuit packages. For example, in the case of the cen- 
tral pulse distributor,’ which has 3312 simulated faults and whose 
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SECOND STAGE THIRD STAGE 
SCRAMBLING SCRAMBLING 


4 DIGITS 4 DIGITS 4 DIGITS 


Vig. 7 — Data reduction. 





Taste III— Dicrionary RESOLUTION STUDY 
(CENTRAL PuLSE DISTRIBUTOR) 


N(t) 








Ist stage lst & 2nd stage All three stages 
1 436 910 924 
2 300 653 663 
S 101 68 67 
4 65 26 26 
5 18 4 3 
6 49 12 10 
7 5 4 2 
8 54 1 1 
9 8 1 1 
10 7 1 1 
11 3 1 1 
12 3 0 0 
13 0 0 0 
14 1 0 0 
15 0 0 0 
16 1 1 1 
17 0 0 0 
AND 
UP 





N(z) = Number of dictionary numbers having 7 associated packages. 
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fault pattern has 1150 bits, 15 reference vectors were used. Initially, 
only the first stage of the final reduction process (rotation and addition) 
was used to produce a 4-digit number. In the resultant dictionary only 
59 percent of the entries were associated with four or fewer circuit pack- 
ages and the mean was 2.56 packages per entry (see Table III). As the 
second stage was added to better scramble the data, 90 percent of the 
entries in the dictionary associated with four or fewer circuit packages 
and the mean was improved to 1.62 packages per entry. The third stage 
was added and the resultant dictionary had 95 percent of its entries 
associating with four or fewer circuit packages and a mean of 1.58 
packages per entry. Further experimentation with the addition of a 
fourth stage reduction process did not provide significant improvement 
in resolvability. Thus, a three-stage reduction process was adopted. 
The dictionary is rather compact; it has only 52 (8-3 by 11) pages (for 
the central pulse distributor). A sample of the format printout is shown 
in Fig. 2. Sample evaluation results of this type of dictionary are dis- 
cussed in Section V. 
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Dynamic Response of Systems of 
Mutually Synchronized 
Oscillators 


By M. B. BRILLIANT 
(Manuscript received September 8, 1966) 


Recent studies have been concerned with conditions for the stability of 
synchronized systems and expressions for equilibrium frequency. This 
paper describes the transient response of special configurations of syn- 
chronized systems of arbitrary size, as well as frequency response and 
gutter response for a few cases. Tentative extrapolations to more general 
configurations are suggested. 


I, INTRODUCTION 


Recent studies have established the sufficiency of certain rather 
broad conditions for the stability of linear synchronized networks, ’ 
and have shown that valid expressions for the equilibrium frequency 
of such systems can be obtained if initial conditions are taken into 
account. Description of the transient response of such systems is of 
interest, but results for general configurations have not been obtained. 
This paper describes the results of studies of special configurations 
of systems of arbitrary size, and some tentative conclusions about 
more general configurations are suggested. 


II. SYSTEM EQUATIONS 
The equations for the synchronized system will be taken in the 
form used by Gersho and Karafin’ in their (9): 
N 
Pat) = Walt) + hat) * DE damlDn(l — tam) — BaD], 
mn=1,-'- ,N (1) 


(where the star denotes convolution). In Laplace transformed form, 
assuming zero initial conditions, 


319 


320 THE BELL SYSTEM TECHNICAL JOURNAL, FEBRUARY 1967 


sP\(3) = Val6) + Hu) D dant Pal6) — Halal) 


on ae ae) 


In these equations, p,(t) is the phase of the oscillator at the nth station, 
v,(t) is the free-running frequency of that oscillator with the effects 
of local disturbances added, h,(¢) is the impulse response of a control 
filter at the nth station, 7,,, 1s the delay on the transmission link from 
the mth station to the nth, and a,,, is an averaging coefficient asso- 
ciated with that link. The coefficients are normalized so that 


N 
>> dom = 1. (3) 
m=l1 
Normally, a,,,, is zero. The filter gain H,(s) has the dimensions of inverse 
time; its zero-frequency value, assumed to be nonnegative, is 


H,(0) = dn - (4) 


These equations, as pointed out by Gersho and Karafin, are conformable 
with the linear equations used by Karnaugh’ if v,(t) is understood to 
include not only the free-running frequency of the oscillator at the 
nth station but also the sum of the transient disturbances at that 
station as well as some initial condition terms. 

The assumption of zero initial conditions in (2) depends on the 
following simplifying procedure. Since only dynamic responses are to 
be studied, and since the equations are linear, the steady-state solution 
can be subtracted from the total response. Thus, »,(¢) and p,(t) will 
be taken to represent only the disturbance component. Where the 
disturbance is transient, the v,(t) will be assumed to have zero values 
before the disturbance begins, and the initial phases will be taken as 
zero. The result of this procedure shows only the response to the dis- 
turbance, to which the steady-state solution would have to be added 
to determine the total frequencies and phases. 

Although formal results for arbitrary filters and arbitrary delays 
will be obtained in a few cases, emphasis will be placed on the simple 
case of flat filters H,(s) = X, (in effect, no filters) and zero transmission 
delays. In this case, the filter gains \, determine the time scale of the 
response. There seems to be no compelling practical reason to make 
the \, much greater than the reciprocal of a second. The response time 
can then be assumed to be large compared with the transmission 
delays expected in most cases as well as large enough so that the re- 
sponse would not be severely affected by the inherent low-pass filter 
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effects of ordinary electromechanical control elements. The results are 
sufficiently encouraging, from a practical standpoint, to suggest that 
it may not be necessary to incorporate filtering by design, so that the 
simple case appears to have some practical value. 


III. AN ELECTRICAL ANALOG, RECIPROCITY 


Although explicit transient responses have been derived only for 
specific system configurations, it is possible to state, for systems of 
arbitrary configuration, a condition sufficient to guarantee that the 
transient response is not oscillatory. This condition is a reciprocity 
condition derived from the properties of a passive electrical network 
analog. 


3.1 Case 1: Tam = O, H,(s) = az 


Consider an electrical network as shown in Fig. 1, having N nodes 
in addition to a ground node. A capacitor C,, is connected from the 
nth node, n = 1, --- , N, to ground, and a resistor R,,, = Ry, 1S con- 
nected between some, not necessarily all, pairs of nodes n, m. A current 
source delivers current from ground into each node. The Laplace- 
transformed node equations are 





1 a, 
sE.() = & 1) + Lge En) — F.)]. (5) 


Fig. 1— Part of the electrical analog of a reciprocal system with flat filters and 
zero delays. 
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These equations are similar in form to (2) representing a synchronized 
system, and can be identified with them if the nth node of the electrical 
network is identified with the nth station of the synchronized system, 
and 


L,(s) = P,(s), (6a) 

[,(8s) = C,V,(9), (6b) 
1 

Tis = AnydC, ’ (6c) 

Tan = O, (6d) 

H,(s) =), - (6e) 


Note that V,(s) is not a voltage, but a reference frequency. 
The reciprocity condition 


imposes a condition on the averaging coefficients and filter gains in 
addition to the simplifying conditions of flat filters and zero delays. 
This condition immediately gives, from (6c), 


CinkiCy = Gyre (8) 


The capacitances C, are to a certain extent arbitrary, in that if 
a system has an analog with given C,, equivalent analogs can be formed 
by multiplying all the C, by any common factor and. “rescaling the other 
elements. The capacitance at the node corresponding to any one selected 
station can therefore be chosen arbitrarily; (8) shows how the capaci- 
tances for stations to which it is connected can then be derived using 
only parameters of the synchronized system: 


OnmXn 
For a station that is connected to the selected one by a path of M 
links, via MZ — 1 intermediate stations, iteration gives a formula of 
the form 
On yin Nae Cas: 


C Onons Qnines 
nM — . eee 


Anino Anon On ynny—s Nes (10) 
where n is the index of the selected station and 7, is the index of the 


kth station in sequence along the path. 
Unambiguous determination of the C,, requires that if two or more 
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paths exist between two stations, the formula (10) should give the 
same result on all paths. This is equivalent to the condition that the 
product of the averaging coefficients taken counterclockwise around 
any closed loop must equal the product of the averaging coefficients 
taken clockwise around the loop: 


QninsAnans > * On arn, = Onin ae Onenz2nen: * (11) 


This condition will be called the reciprocity condition for synchronized 
systems; a system that satisfies this condition will sometimes be called 
a reciprocal system. It is easily seen that the reciprocity condition 
is both necessary and sufficient for the existence of a passive electrical 
analog of the form of Fig. 1, assuming that the conditions of flat filters 
and zero delays are also satisfied. 

Since the poles of an RC network response function are all simple 
and lie on the negative real s-axis,” its transient response consists 
entirely of real exponential components. It follows immediately that 
a reciprocal system with flat filters and zero delays cannot have an 
oscillatory transient response. Moreover, errors in parameter values 
that cause small departures from reciprocity cannot immediately 
result in the appearance of oscillatory components. Such components 
are represented by conjugate pairs of complex poles; since the pole 
locations are continuous functions of the parameter values, no pole 
can move off the real axis until it has first moved along the axis and 
joined another real-axis pole to form a double pole, assuming that the 
departure from the reciprocal ideal is not of such form as to add new 
poles. | 


3.2 Case 2: Ta, small, H,(s) nearly flat 


This conclusion js strictly true only for zero delays and flat filters. 
However, it may be expected that delays much smaller than the system 
response time, or filters that are nearly flat up to frequencies much 
larger than the reciprocal of the response time, will have little effect 
on the transient response. In fact, it can be shown in specific cases 
that the addition of any delay, however small, introduces an infinite 
number of oscillatory components, which nevertheless are small in 
amplitude and rapidly damped so that their total effect 1s small. It 
may be assumed that the omission of delays and high frequency cutoffs 
is comparable to the neglect of the same parameters in ordinary circuit 
analysis. 

It is not necessary that the filters be flat in order that the system 
have an electrical analog. The resistors can be replaced by any 2-terminal 
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networks so as to simulate any filter that has a ‘‘positive real” fre- 
quency response function. If the transfer function of the filter can be 
synthesized as the admittance of a network of resistors and capacitors 
only, the analog will still be an RC network and the transient response 
obviously remains nonoscillatory. 

While this study is nominally confined to dynamic behavior, the 
Appendix shows how the reciprocity condition simplifies the steady- 
state analysis. | 

V. E. Bene’ has pointed out that if the a,,, are considered as the 
transition probabilities of a Markov process, as in his original study 
(unpublished work, 1959) of stability and equilibrium frequency, the 
reciprocity condition introduced here is equivalent to the condition 
of reversibility of the Markov process, which in turn is related to 
detailed balance in statistical mechanics. 


IV. TWO-STATION SYSTEMS 


The analysis of a system of two stations offers not only an introduction 
to the techniques of analysis but also an example of the behavior of 
small systems for comparison with the behavior of the large systems 
to be described in later sections. 

An impulse disturbance of frequency is assumed to occur at one of 
the stations, which we then designate (without loss of generality) 
as station 1. This form of disturbance can be interpreted as a brief 
rise in frequency which is almost immediately corrected, leaving a 
residual phase error of one unit of phase. Alternatively, it could rep- 
resent any disturbance that gives rise to the sudden appearance of a 
phase error. 

The system equations, from (2), are 


sP,(s) = 1+ A,(s)e *’**P.(s) — H,(s)Pi(s), 
sP,(s) = H.(s)e ****P,(s) — H2(s)P2(s). 
These equations are easily solved to give 
s + H,(s) 
s+ s[H.(s) + H.(8)] + His)HASfl — egy 
H.(s)e *"** 
| s* + s[Hi(s) + H,(s)] + H.(s)H2(s)[1 — e°°"7"7"""] 
The final value theorem gives 


(12) 


P,(s) = 


P,(s) = 


Az 


ais ae Os aa CPE 
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as the ultimate displacement in phase caused by the disturbance. The 
equality of the two final values, signifying no net change in the phase 
difference between the two stations, is a necessary consequence of the 
uniqueness of the steady-state solution. 


4.1 Case 1: H,(s) = Xn, Tam = O. 


More explicit results for the transient response are obtained in the 
special case of flat filters and zero delays. The transforms become 
simple enough for inversion by inspection; the result in the time domain 
is 





_ As Ar Oa that 
ae Or ae a 


v 
BO ae 


(15) 





[1 _ 2 (As +a) a 


These equations indicate a simple exponential approach to the final 
value, starting with initial phases [immediately after the impulse in 
v,(t)| of 1 at the first station and 0 at the second. Such behavior appears 
satisfactory for a practical system. 


4.2 Case 2: H,(s) =, tam = T.- 


To determine the effect of delays, the system will be made as simple 
as possible in other respects. The filters will be assumed flat with equal 
gains, and the delays will be assumed equal. In this case, the solution 
(13) becomes 


sti 
P,(s) —~ "9 2 2.-2sr ? 
gs + 24s + ° — Ne (16) 
P,(s) = Ae 


s+ Ds +d? — r2eoB 


The denominator can be factored, and a partial expansion in partial 
fractions gives 


P,(s) = 3[Q(8) + Q2(3)], 


(17) 
P,(s) = 3[Qi(s) — Q2(8)], 
where 
Q(s) = —+—., 
s+tr-— de” (18) 
Q.(s) = : 


StAH+ Noor 
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One approach to the inversion of these transforms is to divide num- 
erator and denominator by s + A and treat the result as the summation 
of a geometric series. Expansion of the series gives 





eo a ad 
(8s) = 27a 
m=0 (s + d) (19) 
_ [vo] (—)A)"e ”*" 
()2(s) “se x (s aL a ’ 
from which, by (17), 
fo) » ay —2kst 
P,(s) = 2 (+? io 
roe) Nr rig Setlist 
P.(s) = do (s a 7 : 
Inversion term by term gives 
[¢/27] as f _ Wh ah ener) 
nfo = “$e Nb = akon 
ia (21) 


[(é#—-7)/27] aaa = (2k + lye ue 


p2(t) = — (2k + 1)! ’ 


where the square bracket in the limit of summation (but only there) 
denotes the integer part of the enclosed expression. This result can 
be numerically evaluated term by term if the product Av is known. 
It gives an exact result (for the assumed model) up to a time depending 
on the number of terms evaluated. Fig. 2 shows a graph of the cal- 
culated results for Ar = 0.1, that is, delay equal to one-tenth of the 
reciprocal of the filter gain. 

The interpretation of this result is that the response of each station 
to changes in phase at the other is delayed for a time equal to the link 
delay 7. Thus, from t = 0 toé = 7, station 2 is completely undisturbed. 
Meanwhile, from ¢ = 0 to ¢ = 27, station 1 observes no change in the 
frequency received from station 2 and therefore, its response is ex- 
ponential with time constant 1/A. Therefore, from t = 7 tot = 387 
station 2 responds to the exponential response received from station 1, 
and so on. The result (21) could in fact have been derived by tracing 
out the response of the system in this manner. 

A second approach, inherently inexact but more useful for times 
that are long compared to the transmission delay, is to complete the 
partial-fraction expansion of (18). This requires in principle determina- 
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EXACT 


——— APPROX 
=== ZERO DELAY 


far = 0,1 


PHASE AT NTH STATION 





fe) 0.5/X 1.0/r 1.5/d 
TIME,t 


Fig. 2— Impulse response of a two-station system. 


tion of the locations of all the poles, which are determined by the 
transcendental equation 


staA= +zre"," (22) 


where the “plus” sign refers to Q,(s) and the “‘minus’”’ sign to Q,(s). 
This equation has, in general, an infinity of solutions. However, if 
Ar is a small number, the most important components will be those 
due to poles of the order of magnitude of A. The exponent in (22) 
is then small, so that the exponential can be approximated as 


ew 1 —sr. (28) 


Using this approximation in (18) gives a form which is easily inverted 
to give, finally, from (17), 


mM L a 1 paws | 
ni 5 | 5 i ae or 


eS ee ee a sro | 
py | ee 





(24) 
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This obviously has an error at ¢ = 0, which is small if Ar is small, but 
has the correct final value determined by (14). This result is compared 
with the exact response, as well as with the response of a zero-delay 
system, in Fig. 2, which illustrates the case of \7 = 0.1. The approxi- 
mation is better for smaller Xr. 


V. LARGE FULLY INTERCONNECTED SYSTEMS 


Next to be considered is a network of N identical stations in which 
all stations transmit via identical direct links to all others. All co- 
efficients a, are assumed equal: 


1 
N—1’ 


If an impulse disturbance occurs at the first station, all the other 
stations display identical responses, so that 


p2(t) = ps(t) Ser tee, + ae py(t). (26) 


The system response can, therefore, be described in terms of two 
equations in P,(s) and P.(s): 


sP,(s) = 1 + H(s)e *’P.(s) — H(s) Pils) 





n=1,--- ,N, men. (25) 


Qaim = 


(27) 
sP.(s) = wile e "[P,(s) + (N — 2)P.(s)] — H(s)P.(s). 
These equations can be formally solved to give 
s+ H(s) — (2 \n@e 
Pl) 
(28) 
_ Ase” 
Fa) A’ 


where 


A=s + sH1()| 2 = eu — Ze | 
2 


" w| ~ Gy ye" - r : Jee a 


The final value theorem gives 











p,(20) = P,(0) = eee y (80) 
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5.1 Case 1: H,(s) = \, Tam = O. 


In the special case of flat filters and no delays, inversion of the 
Laplace transforms gives 


1 ] ndt/(N—-1) 
pit) == + (1 — 1), 
N N (31) 
1 —nXt —1 
p(t) = nT [1 — 2 At/(N ale 
In this case, the response is an exponential approach to equilibrium 
as in the 2-station system. When one station is disturbed, the other 
stations respond in unison, as one. 


5.2 Case 2: H,(s) = XN Tam = T 


In the case of equal positive delays and flat filters, the solution (28) 
in transformed form can be partially expanded in partial fractions 
to give 


PAG) = = (10) + (V — NQ6)1, 





(32) 
Pals) = = [Q() — Q4(5)], 
where 
ee ae (33) 
Q,(s) = : —— 
staAvAt+ V1 e€ 


This is similar in form to (17) and (18), and the same methods can 
be used to evaluate the transient response. The principal difference 
between this and the 2-station case is that the conditions for cancella- 
tion of odd or even terms in the series of delayed responses (21) do not 
hold in the many-station case, and the antisymmetric component, q.(¢), 
is more rapidly damped than the symmetric component q,(¢). The results 
for the zero-delay case and for the case of \v = 0.1 are shown for a 
6-station system in Fig. 3. | 

The simplicity of both the analysis and the result can be attributed 
to the condition that all stations and all paths are identical. Although 
the effects of slight departures from this condition may be of practical 
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FREQUENCY IMPULSE ~__, 
HERE 


PHASE AT N'H STATION 





0 0.5/X 1.0/2 1.5/X 
TIME, t 


Fig. 3—Impulse response of a fully interconnected 6-station system. 


interest, the slightest departure will destroy the symmetry and vastly 
complicate the analysis. As a guess, it may be supposed that the effect 
of a slight dissimilarity among paths will be smaller than the effect 
of removing some of the paths. When all but N paths have been re- 
moved, in such a way that the system forms a ring in which each station 
receives only from its two nearest neighbors, a new form of symmetry 
appears, which will be used in the next section. 


VI. THE BILATERAL RING 


A bilateral ring is defined as a ring of N identical stations, with 
2N identical one-way links forming N two-way links by which each 
station sends to, and receives from, its two nearest neighbors, one on 
each side. This may be viewed as the opposite extreme to the fully 
interconnected system, providing the longest possible indirect paths 
in a system of N identical stations. (Longer paths are possible in a 
chain, but the stations cannot be identical because each end station 
has only one neighbor.) 

The equations of the bilateral ring, in transform form, are 


sP,(s) = V,(s) + H(8){2[Prsi(s) + Prise” — P,(9)}, 
n=1,2,---,N, — (34) 
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where addition in the index n is performed modulo N, so that Py.,(s) 
is P,(s), Po(s) is Py(s), and the (V — m)th station can be alternatively 
designated as the (—m)th. This system of equations will be simplified 
by a form of Fourier analysis. We define 


N 
Q(s) = Pe ?"", k=1,---,N, (35) 
n=l 


where 7 is the imaginary unit. It can then be shown by direct sub- 
stitution that 


N 
PQ) == Det, nai N, (36) 
N im 
Similarly, variables U;,(s) will be defined by transformation of the 
V,(s) as in (85), with inversion as in (36). The linearity of the Laplace 
transformation implies similar relations among the variables in the 
time domain. All these relations remain unaffected if any n or k is 
changed by adding or subtracting N. 
Let the nth equation in (34) be multiplied by e7”""””, and the equa- 
tion summed over all n. The result is 


sQ.(s) = U.(s) + HSE + Pre” — 110.6), 
Siw AN (CD 


This can be solved immediately to give 


bs 
Qi(s) = s+ H(s[l — e* cos (2rk/N)] 


Given a set of transient frequency disturbances v,(é), one may find 
their Laplace transforms JV,(s), find the U;,(s) using (85), find the 
Q,(s) using (38), use (36) to obtain P,,(s), and find the phase disturbances 
p,(t) by inverse transformation. 

In the case of an isolated impulse in frequency at the Nth station, 
we have 


(38) 


V,(s) = 0, n=1,2,---,N-—1; Vy(s) = 1. (39) 
By using (35) we get 
U,(s) = 1, kK=1,--- ,QN. (40) 


Explicit solutions will be obtained here only for cases in which the 
filters are flat. Under these conditions, 


Q,(s) = : 


<7 ies ae (41) 
s+ A{l — e * cos (Q2k/N)] 
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The complexity of the result depends on whether the delay 7 is assumed 
zero or positive. 


6.1 Case 1: H,(s) = d, tam = O,N < @&. 
If the delay is zero, (41) can be inverted immediately to give 
g(t) = exp {—A[1 — cos (Qrk/N)]?} (42) 


and the phase disturbances, using (86), are 


p(t) = = » e?*™™/™ oxy {—A[1 — cos (Qrk/N) IE} (43) 


The Nth term in this sum js real, as is the (V/2)th term if N is even. 
Ivor all other k, the kth term is the complex conjugate of the (V — k)th 
term, so that the sum is real, and may be expressed as the sum of the 
the real parts of the individual terms: 


1 N 


p(t) = NT cos (2ank/N) exp {—d[1 — cos (2rk/N)]t}. (44) 

k=1 
The Nth term in this sum is a constant term, which applies equally 
to all stations and does not affect the phase differences between sta- 
tions. All other terms are real exponentials approaching zero with | 


increasing time. The dashed curves in Figs. 4, 5, and 6 show the response 
of a 6-station ring calculated from (44). 


6.2 Case 2: H,(s) = Xd, tam = O, N = &. 


This result can be extended to rings of indefinitely large size in two 
different ways, so as to specify the response either a given number 
of stations away from the source of the disturbance, or a given fraction 
of the circumference away from the source. For the first approach, 
which gives an exact result for an infinite ring, let 

6, = 2nrk/N (45) 


and let N increase without limit (approach infinity). Then the limit 
of (43) defines the integral 
—Xt 20 
() = — | e’”” exp (AZ cos 6) dé, (46) 
21 0 
which is related to a known integral form* for the modified Bessel 
function of the first kind, order n, and gives 
p(t) =e I, (dd), n=--- —1,0,1,--- 
= p_,(). (47) 
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) 7X 2/d 3/X 4/X 5/X 6/X 7/d 
TIME, t 


Fig. 4— Impulse response of bilateral rings with zero delays. 


Curves calculated from this equation are shown as the solid curves 
in Fig. 4. 

Full exploitation of this result requires that the station at which 
the disturbance originates be called the zeroth, and that neighboring 
stations be indexed with positive integers to one side and negative 
integers to the other side. At any time ¢, the largest phase disturbance 
is at the station at which the original disturbance occurred. The asymp- 
totic approximation for large 2, 
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z 


€ 


oe ee 


shows that the phase disturbance decreases with increasing time 
roughly as 





Le) 


1 
. Oe ~ ie —< 49 
Sas 9) 
Although this result gives the wrong limit for a finite system, it gives 


a clear picture of the early behavior while the response is still sub- 
stantially localized. 


ZERO DELAYS 


LARGE RING APPROXIMATION 
——— 6-STATION RING (EXACT) 


2/N 


PHASE AT NH STATION Pn(t) 


1/N 





fo) o.AN®A 0.2N7/r 
TIME,t 


Fig. 5— Large-t approximation for a large bilateral ring. 
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Fig. 6— Impulse response of a 6-station bilateral ring. 


6.3 Case 8: H,(s) = \, Tam = O, N large, t large. 


The alternative approach provides a better approximation for large ¢ 
after the disturbance has spread around the ring. When ¢ is large, 
the dominant terms in (44) are those in which cos (27k/N) is closest 
to unity, including not only those in which k is small but also those 
in which k is close to N, or, equivalently, & is small and negative. 
Since the kth term and the (V — k)th or (—k)th term are equal, the 
latter terms can be effectively included by doubling each term for 


small k. For large N, the approximation 


2 
cosx yl — a (50) 


can be used for these terms. The Nth or zeroth term is a constant 1/N. 
The other terms, which are small, can be omitted or included as con- 
venient; since it is difficult to specify in advance which terms are 
negligible, it seems safest to include them all, at least formally. Thus, 
approximately, for large JN, 


pit) & = + = Dd cos (2rnk/N) exp (=25* nt): (51) 
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The time constants are proportional to the square of the number of 
stations in the ring. The components have sinusoidal spatial distribu- 
tions around the ring, and the time constant is inversely proportional 
to the square of the spatial frequency. Some curves calculated from 
(51) are shown in Fig. 5, compared with the response of a 6-station ring. 


6.4 Case 4: H,(s) =, tam = 7,N < &. 


If the delays are positive but all equal, the methods used in the 
2-station system can be applied to the inversion of (41). The exact 


' result is 


g(t) = 52 X£08" Gah /N)t = mr)"e 


m=0 m! (52) 


The approximation based on (23) gives 


_ 1 7 1 — cos (2rk/N) 
Ql) 1 + Az cos (27k/N) exp { ve + Xr COS kiN) || (53) 


which may be compared with (42). Curves calculated from these 
equations for a 6-station ring with \7 = 0.1 are shown in Fig. 6 and 
compared with the zero-delay case. 


VII. BILATERAL CHAINS 


It has been mentioned previously that a chain lacks the simplicity 
of a ring because of the exceptional nature of the end stations. However, 
given any chain of N stations, an analogous ring can be formed by 
duplicating all stations except the end stations so as to form a second 
chain between the end stations as shown in Fig. 7, and taking the value 
4 for each of the two averaging coefficients at each end station, leaving 
all other parameters unchanged. The response of the chain to a dis- 
turbance at any station can be found by applying the same disturbance 
at the corresponding station or stations in the analogous ring; the 
response of each half of the ring will be the same as the response of 
the original chain. 

A bilateral ring, as studied in the preceding section, will result if 
the stations in the chain all have equal filter gains and if all averaging 
coefficients (except at the end stations) equal $. Such a chain will be 
called a bilateral chain. Thus, in particular, the response shown for 
6-station rings in Figs. 4, 5, and 6 will also be observed in 4-station 
bilateral chains disturbed by an impulse at an end station. The response 
to a disturbance at any other station may be obtained by superposition 
of two station responses calculated from the ring; the responses to be 
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{ d24 a23 a32 a34 1 


V;(t) V(t) V3(t) V4(t) 





Vo(t) v3(t) 


Fig. 7— A chain and its analogous ring. 


superposed may be identified by supposing that the disturbance 1s 
propagated in both directions from the source and is reflected at either 
end of the chain. 

Alternatively, in principle, the response of either the ring or the 
chain might be determined by superposition of an infinite number of 
terms of the infinite-ring response determined in the preceding section. 
The response of an infinite ring is the same as that of an infinite chain 
extending in both directions from the source of the disturbance, since 
the two networks are indistinguishable. The response of a finite ring 
could be calculated by supposing the disturbance to propagate around 
the ring an unlimited number of times in both directions. For a finite 
chain, the disturbance could be considered to spread in both directions 
(except when the disturbance originates at an end station) and to be 
reflected whenever it reaches an end station. This method may be 
useful in large chains or rings as a refinement of the simple approxima- 
tion of a large chain or ring as an infinite one. 


VIII. UNILATERAL RINGS AND CHAINS 


All the networks studied in previous sections have satisfied the 
reciprocity condition, and in consequence all components of the response 
have been nonoscillatory: strictly so in the zero-delay case, and ap- 
proximately in the case of small Ar. In this section, the opposite ex- 
treme is studied. In the unilateral ring, the product of the averaging 
coefficients in one direction is positive, while every coefficient in the 
other direction is zero. 
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8.1 Rings 


To define a unilateral ring, we assume a ring of N identical stations 
and assign a positive direction around the ring. Each station transmits 
only to its nearest neighbor in the positive direction. Each station 
then receives from only one other station and thus has only one averaging 
coefficient equal to unity. All links are identical. The system equations 
are 


sP,(s) = V,(s) + H(9[P,.()e"" —P.)], nm=1,---,N. (54) 


The transformation defined by (35) and (86) may be applied to this 
network also. In place of (41), assuming the same impulsive disturbance 
as given in (39), we get 


Q.(s) = ; 


oP xG — oR aie 
8.1.1 Case 1: H,(s) =, tam = 0, N < @ 
Only the case of zero delays has been studied in detail. In this case, 
q(t) = exp {—A[{1 — cos (2rk/N)]t} exp [—jAt sin (22k/N)]. (56) 


Hence, 
o.(i = = > exp {i[2enk/N — Mtsin 2xk/N)]} 
‘exp {—A[l — cos (27k/N)]?}. (57) 


The sum is real and may be alternatively expressed as 
N 
Pb) = >> cos [2ank/N — dt sin (Qrk/N)] 
k=1 


-exp {—A[l — cos (Q7k/N) It}. (58) 


The components are not real exponentials, but exponentially damped 
sinusoids. 


8.1.2 Case 2: H,(s) = , Tam = O, N = & 


In the infinite unilateral ring, using (45) in (57) and passing to the 
limit, 


—ré 27 
p(t) = <_ | e’”” exp (Ate’’) dé. (59) 
27 0 


— 76 7 


Expanding exp (Ate~’’) as a power series in Me™’ and integrating term 


by term gives 
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NE oP a 
p(t) = 292 | n=0,1,°-:- 


= 0, n= —1, -2,--- 


(60) 


This result is plotted in Fig. 8. The phase disturbances at adjacent 
stations are in the ratio 


“. 


p,(2) A 


Dn-1(t) n ee 


0.9 
ZERO DELAYS 


0.8 
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0.6 


PHASE AT NTH STATION Pp (t) 





0 1/r 2/xr 3/X 4/x 5/x 6/r 7/ 
TIME, t 





Fig. 8—Impulse response of an infinite unilateral ring. 
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so that for fixed t, and increasing n, p,(t) increases until n = Ut. There- 

fore, at any time ¢ the largest disturbance is at the mth station, where 
iI —-1lsSsmEN. (62) 


Tor large t, the magnitude of the largest phase disturbance, obtained 
with the aid of Stirling’s approximation for the factorial, is asymp- 
totically 


Dall) ~ ex (63) 


This is the same as the asymptotic value (49) obtained for the infinite 
bilateral ring, except that in the unilateral case the peak precesses at 
the rate of \ stations per unit time. 


8.1.3 Case 3: H,(s) = \, Tam = O, N large, t large 
Application of the approximation (50) together with 
sinz yx (64) 


gives, for large ¢ in a large ring, 





p, At) Ro = -+- = > cos [2rk(n — At)/N] exp (— 20 at): (65) 


Compared with (51) this shows a response that resembles that of a 
bilateral ring except that it precesses around the ring in the positive 
direction at the rate of \ stations per unit time. The oscillatory nature 
of the response is associated with the progression of the disturbance 
around the ring. 


8.2 Chains 


A finite unilateral chain is a system with one master station and 
N — 1 slaves. If such a system is disturbed at one of the slave stations, 
the response of each station following it in the positive direction is 
the same as that of the corresponding station in an infinite unilateral 
ring. An impulse disturbance at the master station, however, does not 
correspond directly to any situation in a unilateral ring. A permanent 
phase shift of one unit occurs in the master station output. The effect 
at the second station is the same as that of a step of magnitude A in 
the free-running frequency of the second station, and, a step being 
the integral of an impulse, the response of the entire chain can be 
inferred by integrating the response to an impulse at the second station. 
Each station can thus .be shown to approach its new equilibrium phase 
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monotonically. If the nominal time of response of each station is defined 
as the time of maximum rate of change of phase (maximum frequency 
shift), each station responds with a delay of 1/\ after the preceding 
station. The effect of positive link delays in the unilateral chain is to 
further delay the response without changing its form. 

The unilateral ring is not the analog of any chain in the sense of 
the preceding section. 


IX. RECTANGULAR ARRAYS 


A rectangular array, in which each station is connected to four 
nearest neighbors, can be considered as intermediate between a fully 
interconnected system and a chain or ring, and may be more appropriate 
than either as a model of a network of stations on the surface of the 
earth. A rectangular network with no edges or corners can be laid 
out on the surface of a toroid as in Fig. 9. This network can be analyzed 
by methods similar to those used for rings. 

The stations are most scape indexed with double sibseipe 
m= 1,--- , M,, andn = 1, --- , M.; the number of stations is 
N = M,M,. Assuming equal filters a equal delays, the system equa- 
tions are 


SP nn(S) = Vinn(8) — H(8)Pnn(S) 


FL (S): s-245 
4 é 


++ [Pm.n—1(8) ++ P iS) ++ f gee ee ()) + i green 6) a (66) 





Fig. 9—A toroidally-connected rectangular array. 
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assuming addition modulo M, in the first index, and modulo M, in 
the second. Defining 


auld = FX Pad ex | -20(t + Fr) | 


m=1 n=1 
K=1,---,M,, l=1,:--,M, (67) 


and proceeding as with the bilateral ring, we obtain, in the case of flat 
filters and zero delays, with an impulse disturbance at the 1, , Meth 
(or zero-zeroth) station, 


M, My, 
Dinn(b) = AT p> 2, exp | ian( te a aL 


y(t — 4 cos 28H a ay | 
exp | (1 cos M, cos M, (68) 


Comparison with the bilateral ring is most convenient in the limiting 
cases of large systems. For the infinite array, 


Past) = 6 Tal )1,(%4) (68) 


which has the asymptotic form 
ae. ' 


indicating a more rapid approach to the final value in the rectangular 
array than in the ring. The approximation (50) for large ¢ in large 
arrays i 


Dmn(t) & = > 3 cos | 2n( me + a) 


k=-0 l=-—2@ 


The longest time constant is shorter for a rectangular array than 
for a ring with the same number of stations. Fig. 10 shows some curves 
calculated from (69). 

A bounded rectangular array in a plane is more complicated than 
a toroidally connected array because of the exceptional edge and corner 
stations. However, a bounded M, by My, array can be analyzed in 
terms of an analogous 2M, — 2 by 2M. — 2 toroidal array as shown 
in Fig. 11. All columns except the first and last are duplicated and 
connected as shown by the solid lines to form a cylindrical array, and 
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Fig. 10— Impulse response of an infinite rectangular array. 


then all rows except the first and last are duplicated and connected 
as shown by the dashed lines; averaging coefficients are divided by 
two whenever a station receives from duplicate stations. The toroidal 
array has one station corresponding to each original corner station, 
two for each edge station, and four for each interior station. The re- 
sponse of the original bounded array to a disturbance at any station 
is identical with the response of the corresponding part of the toroid 
when the original disturbance is applied to corresponding stations. 
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Fig. 11—The toroidally-connected array analogous to a bounded rectangular 
array. 


Alternatively, in principle, the response of a finite toroidal array 
can be determined from that of the infinite array by considering the 
response to propagate around a toroidal array in the four cardinal 
directions, or to be reflected from the sides of a bounded array. 


X. RESPONSE TO SINUSOIDAL DISTURBANCES 


The steady-state response of a linear system to a sinusoidal dis- 
turbance is sinusoidal, and the phase difference between the response 
and the input disturbance, together with the ratio of the amplitudes, 
is given by the frequency response function as a function of frequency. 
The impulse response is equivalent in principle to the frequency response 
function as a specification of dynamic properties, since either can be 
expressed in terms of the other through Fourier or Laplace transforma- 
tion. The frequency response functions of a bilateral ring, in particular, 
are the functions P,,(jw), which are the P,(s) evaluated along the 
“real frequency axis” s = jw, for real w. 

The frequency response will be determined in this section for infinite 
rings, both bilateral and unilateral, in the case of arbitrary equal 
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filters and arbitrary equal delays. Although the expressions are more 
complicated than the impulse response expression in the case of flat 
filters and zero delays, they do not become very much more complicated 
in the more general case, for which closed form expressions for the 
impulse response have not been obtained. 


10.1 Case 1: Bilateral Ring, N = « 


For the bilateral ring, an expression for Q;(s) is obtained from (38) 
using (40), and P,(s) is obtained using (36). Using the substitution 
(45) and passing to the limit of infinite N gives 


1: fo" e'”? dg 


is) == a 72 
(s) 2r Jo g + H(s)(1 — e° cos 8) en 
To evaluate this by contour integration, let 
ae Ae (73) 
Then 
n—-1 
PG eae (74) 





a i s+ H(9| 1 _ “ (: -+- 1) | 


integrated counterclockwise around the unit circle centered at the 
origin in the z-plane. 

When n = 0 the integrand has two poles in the z-plane, located 
at the roots of the quadratic equation 


z 20" 1 ask +1=0. (75) 


Since the denominator of the integrand is symmetric in z and I/z, 
one root is the reciprocal of the other. We defer consideration of the 
case where both roots have unit magnitude; then one pole will le 
inside the path of integration and the other outside. Denote the root 
inside the contour by 


where it is understood that the square root is to be taken to have 
whichever sign gives 2, the smaller magnitude. 
For convenience, let 


_ _ As) 
B(s) = 8 ots H(s) ’ (77) 
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this incidentally being the quantity whose magnitude is required to 
be less than unity in the sufficient condition for stability given by 
Gersho and Karafin.* Then 
_1- Vi = 6c" 
Be" (78) 
B(s)e*" 
1 +. V1 = Br (sje *" 
where the second form can be obtained by rationalizing the numerator. 
The integral around the contour is 27j times the residue at this pole, 


so that P,,(s) is equal to the residue, which can be put in the alternative 
forms 


& 


_ fb Vi - B@e" y 

H(s)B""(s)e""" V1 — (se 2" (79) 
ee, is |e 

H(s) V1 — pe" [1 + V1 — Be" J 


For negative n, (74) can be transformed into an integral in y = 1/z 
to show that 


P,(s) 


P,(s) = P_,(8). (80) 


The deferred case in which z, has unit magnitude will now be briefly 
considered. In this case, the quadratic equation (75) has two conjugate 
roots of unit magnitude, or double roots at 1 or —1, and it is easily 
shown that this occurs when @(s)e’’ is real and has magnitude 1 or 
greater. If the sufficient condition for stability mentioned earlier is 
satisfied, this cannot occur in the left half s-plane or on the real fre- 
quency axis except at zero frequency, where a singularity is expected 
to occur in any system configuration. 

Where B(s)e *” is +1, P,(s) is infinite and will ordinarily have a 
branch point. This always occurs at s = 0, and occurs for other values 
depending on the filters and delays. Where G(s)e*’ is real and has 
magnitude greater than unity, P,,(s) will be finite but will have a step 
discontinuity, because as s passes through a value at which {(s)e*’ 
is real, z, crosses the unit circle and must immediately be redefined 
as 2,, and the square roots in (79) abruptly change sign. The function 
P,(s) is thus defined as a single-valued function in the s-plane with 
line discontinuities where it might be expected to have branch cuts. 
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If the system is stable, these discontinuities are confined to the interior 
of the left half s-plane except for s = 0. 

Thus, (79) defines P,(jw) as a continuous single-valued function 
except at w = 0. In the case of flat filters and zero delays, P,(s) is 
the Laplace transform of (47). Fig. 12 shows the magnitude of P, (jw) 
for this case and for the case of Ar = 0.1. 


10.2 Case 2: Unilateral Ring, N = « 


Tor an infinite unilateral ring, a similar procedure gives 


— 1 z" dz 
P,(s) = j2r’ [s + H(s)lze — H(s)e-*" oe 


ZERO DELAY 
—— AT=011 





Fig. 12— Frequency response of an infinite bilateral ring. 
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to be integrated over the same path as (74). When n = 0, the integrand 
has a single pole at z = B(s)e °’. If the magnitude of B(s)e °’ is less 
than 1, the pole is inside the unit circle, and for nonnegative n 
_ B"**(s p utr _ -; 
P,(s) = aay ae ; n= 0,1, 2, (82) 
while for negative n under the same conditions, the substitution y = 1/2 
puts all poles outside the unit circle and 


P() =0, n= -—Il,—-2,-°-. (83) 


Fig. 13 shows the magnitude of P,(jw) graphically. As the magnitude 
of B(s)e *" becomes greater than 1, the pole crosses the unit circle 
and there is a step discontinuity in P,(s) for all n. However, the suffi- 
cient condition for stability mentioned previously is both necessary 
and sufficient, in the unilateral ring, for these discontinuities to be 
confined to the left half-plane. 

The finite value of P,(0), where a singularity should occur, is at- 
tributable to the fact that every station in the infinite unilateral ring 
is a slave station, and no finite change at any given station can alter 
the equilibrium frequency. The infinite unilateral ring 1s in this sense 
a pathological limiting case of the unilateral chain in which the master 
station recedes to infinity and becomes inaccessible. 


1.0/A pe 


0.8/X 


ZERO DELAY 


0.6/A 


Pn(jw) 





Fig. 18— Frequency response of an infinite unilateral ring. 
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XI. JITTER RESPONSE 


Jitter denotes random variations in the phase of a signal. In a digital 
signal, jitter can occur as a result of pattern-induced retiming errors 
in regenerative repeaters.’ Jitter reducers’ can reduce the high-frequency 
components of jitter, but, because the jitter reducer output frequency 
is slaved to the input frequency, the low-frequency components cannot 
be reduced. 

In a mutually synchronized system, the low-frequency components 
of jitter will affect the observed phase differences used to control the 
clocks. Even if the variations in the received phase, after jitter reduc- 
tion, are not themselves objectionable, they might cause objectionable 
variations in clock phases through the cumulative effects of each clock 
acting on the next. To simplify the analysis, only the effects on the 
clock phases are considered; the immediate effects of jitter are omitted. 

It is assumed that the effect of jitter on the link from the mth station 
to the nth is to add a random component z,,,(£) to the phase p,,(t — Tam) 
that would be received without jitter. This random component is 
assumed to have the properties of white Gaussian noise and to be 
independent on different links. Assuming that a jitter reducer can be 
designed that will compensate the immediate effects of jitter, we 
determine only the cumulative effect of jitter propagating through the 
system as a result of its effect on the station clocks. 

The autocorrelation function assumed for p,,,,(t) 18 


El pr) amt + x)] = Ka). (84) 


Here ‘“‘E”’ stands for the ‘‘expectation” or mean value, the star denotes 
complex conjugation (immaterial here since y,,, is real), and 6(¢) is the 
Dirac delta function. K represents the noise power density, assumed 
to be the same for every link in the systems to be considered. 


11.1 Case 1: Phase-Locked Oscillator 


As a standard of comparison, consider the effect of this jitter on a 
simple phase-locked loop of gain \, in which an oscillator is controlled 
by the signal received from an unperturbed source over a Jjittered link. 
The equation for the output phase p(é) in this system is 


p(t) = Fy + MPot + uw) — pd), (85) 


where F, is the free-running frequency of the controlled oscillator, 
F,, the frequency of the master source, and the link delay is assumed 
zero. Since the system is linear, and we are interested only in the random 
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component of the output, we may set fF; = F, = 0 without loss of 
generality. Thus, the Laplace-transformed system equation becomes 


sP(s) = AM(s) — AP(s) (86) 
with solution 
Pes ay M(). (87) 


We obtain the mean-square value of p(t) from its autocorrelation 
function ¢(xv) evaluated at x = 0. This is determined from the power- 
density spectrum 


— z . 7: . —1Hx 
Pw) = eae ae o(x)e dx (88) 
by means of the inverse transformation 


pee / B(w)ei*? deo. (89) 
The power-density spectrum of the input u(t), obtained from (84) and 
an integral of the form of (88), is flat, equal to K/2z, for all w. The 
output power density is obtained by multiplying this by the square 
of the magnitude of the frequency response, obtained from (87): 
VK 


*O) 25 Te FAT om 


The inversion integral (89) is evaluated by means of a partial-fraction 
expansion. The analytic continuation of (90) in the s-plane, s = ju, 
has poles in both the right and left half-planes. Since (89) is a Fourier 
(not Laplace) inversion, terms due to poles in the right half-plane will 
be zero for positive x; thus, for positive x we need only consider the 
left half-plane. We obtain 





\Ke"™ 
o(t) = 5 (91) 
and, as a limit, 
\K 
(0) => (92) 


is the mean-square value of p(t). The rms phase error is of course the 
square root of this. 
11.2 Case 2: Bilateral Ring 


We now consider a bilateral ring with flat filters and small delays. 
Each station receives two inputs, each with jitter with the autocor- 
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relation function (84), and therefore power density K/2z7. Each input 
is multiplied by \/2, to produce an error-signal component with power 
density (\/2)*K/2zm, and when two independent components are added, 
their power densities add to produce a total power density of 2(\/2)°K /2z, 
or \"K/4m. The effect of the two jitter components is thus equal to 
the effect of a white noise component of power density \*K/42 added 
to the free-running frequency. We ignore the steady-state components 
and consider this to be the only input at each station. Thus, we assume 


v 





E ya), (93) 


Epo (t + 2)) = 53 


The variables u,(t) are derived from the v,(¢) as in (84); direct evalua- 
tion gives 





VK 
Klut(ur(é + x2)] = 42N (2), a (94) 
0, k Al. 


This shows that the u,(¢t) are uncorrelated, and therefore (since we 
have assumed Gaussian distributions) independent, each with power 
density \”°K/4xN. It follows, since each qg,(t) depends only on the cor- 
responding u,(t) as in (88), that the g,(¢) are independent. Denoting 
their autocorrelation functions by ¥,(~), we obtain their power-density 
spectra, using the frequency response given by (88), as 


: [A + jw — de?*? cos O,][\ — jw — de?*” cos &] | 


where the substitution (45) is used as an abbreviation. 

When k = N, (95) indicates infinite power density at zero frequency. 
The autocorrelation function Yy(xz) is consequently infinite for all a, 
and in particular the mean-square value of gy(t) 1s infinite, so that 
the mean-square value of each p,(t) is infinite. This occurs because 
the random variations that the jitter induces in the system frequency 
cause the system phase to execute a random walk. However, since gy(t) 
contributes equally to every p,(t), 1t does not affect the phase differences 
between clocks, and all other q,(¢) have finite mean-square values. 
It follows that while the phase of each clock tends to deviate indefinitely 
far from that of an unperturbed clock of the same frequency, the 
deviation between clocks in the same system tends to remain bounded. 

We are primarily interested in the phase difference between the 
clock at each station and the delayed signal received from an adjacent 
station. The mean-square value of this phase difference will be denoted by 
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ee Et lp - Dnaill _ 7)V} ’ (96) 


We express the phases in terms of q,(¢) using (86), writing the square 
of the real sum of these complex quantities as the product of the sum 
by its conjugate so that expansion of the product gives terms of the 
form of the left side of (84); thus, 


N-1 
Panes = 2 2) {W(0) — Re fe? va(a)]}. (97) 
Therefore, (95) should be used in an integral of the form of (89) 
to determine y,(0) and y,(7). The analytic continuation of (95) in the 
s-plane has, in the left half-plane, all the poles of (41), and in addition 
the reflections of these poles in the right half-plane. We continue to 
use (23) as an approximation when Xz is small. The result is 


_,,{_1 — cos 4 ) 
MK exp me + d7r cos 6, 


ec > 0. 
Hel) 4N(1 — cos 6,)(1 + Ar cos 4,) ’ v20 (98) 
In particular, to the first order in Xz, 
—, MC(1_— Az cos 4;) 
(0) ™ AN(1 — cos 4,) (99) 


and, again using the linear approximation to the exponential, 
AA (1 — Az) 





vi(7) & ANG = cosa) (100) 
We now find, from (97), that 
N—-—1\AK 
PD, nil ~ ( N } D i (101) 


The mean-square phase discrepancy observed in received signals is 
thus substantially independent of the size of the system and sub- 
stantially unaffected by small link delays. It is roughly equal to the 
mean-square phase error, given by (92), that would be induced, by 
the jitter in a single link, in a simple phase-locked oscillator with control 
gain X. 


11.3 Case 3: Unilateral Ring 


In a unilateral ring, each station receives only one input, so that 
the equivalent v,(¢) has power density \”K/2m and its autocorrelation 
function has twice the value given in (93). The appropriate frequency 
response is given by (55), so that instead of (95) we get 
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\"K /2nN 


VW, Oa a SP ee een eee ee Se a eee ee 
A ) I ae jos __ Ne eK = Jeo — heer 


(102) 


We continue to use (23) to determine a simple approximation. The 


result is 
oe | — ra 1 Bot e ioe )] 
es Lt Are) | 


Uke) ~ 9N(L — Ar)(1 — cos 6) ane) 
In particular, 
AKC r 
WV Sara a an 7 ys (104) 
and 
—78% 
TOES AK [1 + Are J. (105) 


2N(1 — cos 6) 
Equation (97) is equally valid for the unilateral ring as for the bilateral 
ring, giving 


N —1 
Dei ~~ vK( N ) 


st incre jf ee ||. 
BD, na nae] (4) + 2r7( N )| (106) 


The mean-square phase discrepancy is essentially twice that which 
occurs in the bilateral ring. The link delay has a first-order effect 
on the signal received at each station from the station to which it 
transmits timing control because of the round-trip delay. 





XI. SUMMARY AND CONCLUSIONS 


In this section, I propose to extrapolate the specific results of the 
preceding sections to general conclusions that, although not strictly 
proven, seem quite likely to be true from a practical standpoint. 

It was shown in Section III that a system that satisfies the reciprocity 
condition and has flat filters and no delays will have a nonoscillatory 
transient response. The response was described more specifically in 
later sections for specific configurations: 2-station systems, fully inter- 
connected systems, and bilateral rings and chains, all of which met these 
conditions. These configurations appear to span the extremes of practical 
systems. 
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The effect of delays was determined specifically only for these special 
configurations and for the special case of equal delays and flat filters. 
The effect was shown graphically for \r = 0.1; it appeared to be small 
and unobjectionable. Fig. 6 shows that at all stations, over the time 
range shown, the response to a transient disturbance is actually smaller 
when delays exist. At the zeroth (or Nth) station, where the disturbance 
originates, this can be attributed to the short period after the disturb- 
ance during which the neighboring stations remain undisturbed and 
are, therefore, reliable indicators of the original state. At other stations, 
the appearance of smaller disturbances is due in part to the delayed 
peak of the response. 

I propose to conjecture that the dynamic effect of delay in any 
reciprocal system with flat filters will be equally unobjectionable as 
long as the product of the largest filter gain and the largest single link 
delay is less than 0.1. This would be a unjustified extrapolation from 
a purely mathematical standpoint, but it seems reasonable in the light 
of the physical interpretation suggested in the preceding paragraph. 

The effect of filters with other than flat frequency response has not 
been shown at all in terms of transient response. Two aspects of this 
question appear important. In the first place, it may be possible to 
obtain some improvement in transient response by appropriate filter 
design, but further analysis appears necessary to answer this question. 
In the second place, assuming that the flat filter gives a satisfactory 
response, the effect of high-frequency cutoff, which is inevitable in 
a practical system, must be estimated. A tentative answer to this 
question can be obtained by examination of the expressions for fre- 
quency response P(s) developed for specific configurations. In all 
these expressions, the system response is substantially the same as 
in the flat-filter case as long as the filters H,(s) remain substantially 
flat until the frequency s becomes large compared with the zero-fre- 
quency filter gains \,. This condition establishes an approximate 
bandwidth requirement for the filters. The extrapolation to arbitrary 
configurations is proposed in this case also. 

The effect of departure from the reciprocity conditions is illustrated 
in only one case: the unilateral ring. Here, although the departure 
from reciprocity is the greatest possible, the effect on the transient 
response is mild. The magnitude of the response, and its rate of sub- 
sidence, are substantially unchanged; the principal effect is the preces- 
sion of the disturbance around the ring. The oscillatory components 
in the response can be associated with this precession. 

Extrapolation of this result appears uncertain. The reciprocity condi- 
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tion can be stated in terms of the equality of the products of averaging 
coefficients in opposite directions around any loop. It can easily be 
conjectured that if the product of the averaging coefficients around 
any loop is much larger in one direction than the other, there will 
be a tendency for disturbances to precess around the loop in this direc- 
tion, thereby generating oscillatory components in the response. On 
the other hand, it is hard to imagine pure precession in a multiloop 
network. A possible answer is suggested by the argument in Section III, 
in terms of pole loci, suggesting that a considerable departure from 
reciprocity could be tolerated before oscillatory components began 
to appear. 

This extrapolation is suggested only for the case of flat or nearly 
flat filters and zero or small delays. For other cases, departures from 
reciprocity may give rise to a stability problem. This is suggested 
by the analysis in Section VIII of the discontinuities in the frequency 
response of an infinite unilateral ring, which showed that the stability 
condition that has been shown in the general case only to be a sufficient 
condition is in this case not merely sufficient but necessary. The latitude 
for filter shaping may be smaller in the nonreciprocal case, limited 
not simply by instability but by the deterioration of transient response 
that generally accompanies an approach to instability. 

The analysis of jitter response shows that in certain representative 
cases the effect of jitter does not accumulate in a large system. This 
gives a definite negative answer to the question of whether cumulative 
jitter necessarily occurs in a large system. It seems reasonable to 
conjecture that this conclusion is independent of configuration, and 
remains true for substantially flat filters and small delays, but less 
reasonable to suppose that it will remain true for arbitrary filters. 

Nothing in this study should be construed to indicate a preferred 
configuration for a practical system. Full or nearly full interconnections, 
nearest-neighbor connections, branching networks, or other forms may 
be appropriate. In particular, the apparent superiority of the fully 
interconnected network from the standpoint of transient response must 
be tempered by the practical considerations against setting up a large 
number of very long connections. 
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APPENDIX 


Reciprocal Systems in the Steady State 


The assumption of zero initial conditions, used in the study of tran- 
sient behavior, must now be dropped. Thus, the transformed (2) are 
no longer valid, but the original equations (1) may be used. In the 
steady state, the rate of change of phase at every station is equal to 
the common system frequency f, 


pth =f, n=1,-::,N (107) 
so that 
p(t) = fit, ~-2 <t<o. (108) 


Thus, in the steady state, the v,(¢) being constant, the system equations 
(1) become 


f= + An Do in(Ym — Vn — ftom), M=l,e,N. (109) 


m=1 


The general solution to these equations is the expression given by 
Gersho and Karafin’ in terms of cofactors of a matrix derived from 
the a,,, . In the reciprocal case, let the nth equation in (109) be multiplied 
by C, and the equations be assumed over all n; when the reciprocity 
condition in the form of (8) is applied, all the terms in the phases y, 
drop out and one gets 


N N N N 
f>>¢C, = Do Ce, — #5 Cp DS GamTam- (110) 


n=l m=] 


This can be solved immediately for f, the expression being similar 
in form to the solution reported by Gersho and Karafin, except that 
the C,, , which are easily determined by (9), replace the matrix cofactors. 
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Deformation of Gas Lenses by Gravity 


By D. GLOGE 
(Manuscript received September 14, 1966) 


Gravity forces cause distortions in tubular gas lenses. A theory 1s derived 
here which yields excellent quantitative agreement with measured distortions 
for vartous tube lengths, diameters, and gases. It 1s shown that in a gas 
lens of optemum design the displacement of the optical center has a maximum 
at the end of the lens. The amount of displacement increases with the fourth 
power of the tube diameter and with the square of the gas pressure. 


I. INTRODUCTION 


If a cool gas is blown into a hot tube (Fig. 1), the gas heats up first 
at the wall of the tube and remains cool longer at its center. The density 
therefore, is higher in the center of the tube and decreases toward the 
wall. The increase in density is accompanied by an increase in dielectric 
constant. In this way the gas acts as a positive lens.’’” 

At the same time, however, the cooler gas tends to sink down because 
of gravity, thus causing an asymmetric density profile in a horizontal 
tube.” Though a simple approach already gives an estimate of this 
effect," a more rigorous theory is derived here using a perturbation cal- 
culation which determines the transverse convection currents from the 
unperturbed temperature profile and then uses the currents to correct 
the temperature profile. 


Il. TRANSVERSE CONVECTION CURRENTS 


The tube walls are at a temperature 7’ and AT’ degrees warmer than 
the entering gas. Heat diffuses toward the axis and determines the tem- 
perature field. Using the coordinate system shown in Fig. 1, the tem- 
perature field may be approximated by” 


t= Ty — at|1— ete, (Fae), (1) 
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WARM GAS OUT 


a7 


Z 





TUBE WALL 
AT TEMPERATURE Ty 


COOL GAS IN 


Fig. 1— Convection currents and temperature distribution in a gas lens. 


where a is the tube radius and s a decay length given by the formula 


2 
Q V0 ; 
ate 7 3a (2) 


v,. 18 the gas velocity along the axis and a@ the thermal diffusivity defined 
as the ratio of heat conductivity « to heat capacity: 
K 
ee PCy 8) 

The heat capacity is written here as the product of density p and specific 
heat at constant pressure. 

The temperature is related to the density p and the pressure p by 
the gas equation 
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p = RoT. (4) 


The density determines the gravitational forces gp which drive the gas 
particles in the transverse direction. The transverse components of the 
velocity field v(z, y, 2) can be found from Newton’s Law 


gp = grad p — vpV'v + p “ (5) 


where v is the kinematic viscosity determining the frictional forces. The 
acceleration is described by the total differential dv/dt and for the steady 
state takes the form 


d 
oF = (v grad)v. (6) 


In the problem under consideration the gas may be treated as a 
quasi-incompressible (Boussinesq) fluid. That means that variations 
of density may be neglected, except insofar as they modify the action 
of gravity. Forming the curl of (5) therefore, yields 


curl (gp) = —vp eurl V?v + p curl ©. (7) 
Using (4) and rearranging (7) one finds 
grad T _ grad P) _ Pe dv 
gg xX (grad 2 ae = vy curl curl curl v + curl FF (8) 


Here grad p/p can be neglected compared with grad T/T, and T' in 
the denominator will be replaced by the mean (absolute) temperature 
T. Finally, by inserting (6) one finds 


. (g X grad 7) = »v curl curl curl v + cur! (v grad)v. (9) 


To solve this equation, a tentative velocity distribution is introduced 
which represents the flow lines shown in Fig. 1. The unknown coefficients 
are chosen in such a way that the equation 


divv =0 (10) 


is fulfilled, which assumes that the gas is incompressible. Then the 
velocity components 
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Vv, = si (a’ = x” — y)(a° ot x <= 5y°) 


v, = —4 = ry(a — x — x’) (11) 


vz 5 (a” Sy) 
result which leaves only the coefficient v, unknown, since the velocity 
v.o 18s determined by the forced laminar flow in the tube. », is the vertical 
gas velocity at the tube center caused by the gravitational forces. It 
may be assumed to be much smaller than the longitudinal velocity v,,. 
Though »v, is a function of z the variation of v in the z-direction is neg- 
ligible compared to its variation in the cross-sectional plane and has 
to be considered only in the acceleration term where dv,/dz occurs 
multiplied with the velocity v,,. 

With these approximations, v, can be determined by inserting (1) 
and (11) into (9) which yields 


AT 


4) 
49 


Vo 
02 





ey = 192 - y + 180., (12) 


Third- and higher-order products of x and y are neglected in this equa- 
tion since they are only important at the wall of the tube and contribute 
little at the tube center. 

Equation (12) is a linear inhomogeneous differential equation in z 
with the solution 


= g AT at 8 —2/s _ -2/@ 
ea aCe on) oe 
where 
3 a 
qd Ss 29 y V0 ° (14) 


A discussion of (18) is postponed in order to proceed with the cal- 
culation of the lens disturbance by using the derived convection cur- 
rents to correct the temperature profile which, in turn, gives the density 
distribution and the lens profile. 


III. DISPLACEMENT OF THE OPTICAL CENTER 


The gravitational forces cause a continuous flow of cool gas toward 
the bottom of the pipe, which distorts the temperature profile more and 
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more in the way shown in Fig. 2. The growing temperature gradient at 
the bottom, however, will increase the heat diffusion toward the center 
and counteract the convection effect. The equality of both effects is ex- 
pressed by the equation 


aV°T = v grad T (15) 


which determines the actual temperature profile under the boundary 
condition that T = Ty at the tube wall. 

Considering that the temperature function for axial direction 1s 
much less curved than the radial one, 0°7'/dz’ may be neglected and 
(15) separated with respect to z.° This yields 

(oF aT 
a 


oT oT T — Ty 
ax” = ts) = Uz Ax + Vz dy — Vz ) (16) 


S 





where 7’ — T'y is an exponential function of z as already introduced by 
(1) for the undisturbed temperature profile. 

No straightforward solution of (16) is known. Assuming, however, 
that the gravity effect, to first order, tilts the temperature profile in the 
x-direction as shown in Fig. 2, the amount of this disturbance can be 
calculated. The assumption implies that by transforming T(a, y, 2) 
into new coordinates 


E=2— Ty — T); n= Y; c=2 (17) 


the undisturbed profile can be regained, which in the following is denoted 
by 6(é, y, ¢). Since this is symmetric with respect to £, the corresponding 
transformation in (16) must generate a differential equation for 6 
which contains only even terms in £. The requirement that the odd terms 


g 


a: 


Tw 


=a 


——= (Tw = 8)gs0-> 





Fig. 2— The temperature function 7J(z) and its transformation into a sym- 
metric function 6(z — 67’). 
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cancel yields the following equation for 6: 


af 1 _ , 98 @ —~ Ty a0 
2as( a v, ry: + 26v, R 3E 


The locus of the minimum of the temperature T(x) is of particular 
interest, for this is the optical center of the distorted lens profile. Fig. 
2 shows that this center occurs at a distance 


d = 6Tw — eo (19) 


below the tube axis. Using for @ the undisturbed temperature profile 
given in (1) and solving (18) for 6(7 7 — 6) at € = 0 yields 


(18) 


1 a ; (20) 
oe ee 
s a 


By inserting (2) and (8) into (20) one finally finds 


a: 1 gat AT gq —2/8 —2/q 
d = 7. e (aera mae (21) 
The diffusivity a and the viscosity v for perfect gases are related by 
Eukens formula’ 


a ] C, 
o = 7 (9-52), 2) 


c, being the specific heat at constant volume. As Table I shows, the 
decay lengths s and g given by (2) and (14) differ very little. Since (21) 
is not defined for s = q it is more convenient to use the following ap- 
proximation for (21): 


4 
d= my i (23) 


which is valid for z < 2sq lq — sl. 
In Fig. 3 the displacement of the center of the lens profile is plotted 


TABLE | 
Cp/Co a/v q/8s 
He 1.66 1.55 1.03 
Nz 1.41 1.35 0.93 
CO, 1.31 1.30 0.89 


CH, 1.31 1.30 0.89 
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DISPLACEMENT OF 
OPTICAL CENTER INMM 





DISTANCE FROM INPUT IN mm 


Fig. 3 — Displacement of the profile center in a tubular gas lens of %-inch 1. d. 
for flow rates of 0.5, 1, and 3 liters per minute using air. (a = 0.22 cm’/s). Meas- 
ured data by DeGano? 


versus the distance from the tube input for flow rates of 0.5, 1, and 3 
liters per minute. A tube of 23-inch diameter and 100°C wall temperature 
is assumed. The gas enters at room temperature. The mean temperature 
during the process is assumed to be 7’, = 50°C. The gas is air with a 
diffusivity a = 0.27 cm’*/s. 

All curves show a linear increase of the displacement at the tube input, 
determined by the transverse acceleration of the gas. Further from the 
input the displacement follows the exponential decay of the temperature 
profile. The maximum displacement occurs at z2 = s. Measurements. at 
the end of a 16-cm gas lens using the mentioned parameters are in fair 
agreement with the theory.” 

In Fig. 4 the displacement is shown for a tube of 4-inch diameter 
and two different gases: CO, with a = 0.125 cm’/s and N, with a = 
0.25 cm’/s. The temperatures are the same as in Fig. 3. The flow rate 
is 1 liter per minute. In this case, data are available for various tube 
lengths.* They show an excellent agreement with the predicted behavior 
of d versus 2. 

The focal length of the tubular gas lens has a minimum if the flow 
rate is chosen in such a way that s equals the tube length. The maximum 
displacement occurs at the end of such a lens and has the value 

1 ga’ AT 
max = 5040 2 (24) 

A more useful measure for the gravity effect is the distance D by 

which a light beam has to be displaced off the tube axis to pass the lens 
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without deflection. Integrating at x = D over the tube length L one 
finds D from the requirement that the total deflection cancels: 


i; 
0 Ox 
The development of the (disturbed) tempcrature field 7 about the axis 
yields for small distortion 


dz = 0. (25) 


z=D 





i [D — d@]e*”* dz = 0; (26) 


and finally, by using (23) one has 


Fi 1 = (22 + 1 )or¥ 
D= FF ee (27) 


In Fig. 5 the displacement D is plotted versus the flow rate for CO, 
in a 7-inch tube assuming the same temperatures as in I'igs. 3 and 4. 
Data measured by Steier” show good agreement with the theory. For 
[L>s 
1 ga 

D 3000 a 28) 
is a good approximation. According to this formula, the optical center 
of a CO, lens of optimum design would occur outside the tube if the 
tube diameter is larger than 1 cm. 


DISPLACEMENT OF 
OPTICAL CENTER IN MM 





DISTANCE FROM INPUT IN mm 


Fig. 4— Displacement of the profile center in a tubular gas lens of 14-inch i. d. 
for a flow rate of 1 liter per minute using COs (a = 0.1 em’/s) or Ne (a = 0.2 
em*/s). Measured data by Steier’ - 
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EFFECTIVE DISPLACEMENT D IN mm 





O 0.5 1.0 1.5 2.0 2.2 30 35 4.0 4.5 2.0 
FLOW IN LITERS PER MINUTE 


Fig. 5— Displacement of the optical center of a tubular gas lens using CO: 
and a tube of 7 inches long and 44-inch diameter. Measured data by Steier.* 


IV. CONCLUSIONS 


The calculations show that the temperature distribution in a gas- 
filled tube undergoes a distortion which increases with the fourth power 
of the tube radius. A square law dependence on pressure is predicted for 
the range of 0.05 to 50 atmospheres where the thermal conductivity 1s 
independent of the pressure and therefore, the diffusivity a « 1/p. 

As a measure of the distortion, the displacement of the effective 
optical center in a tubular gas lens is calculated. Using CO, at room tem- 
perature and a tube of 10-mm diameter at 100°C wall temperature the 
optical center occurs at the bottom of the tube. 


REFERENCES 


1. Berreman, D. W., A Lens or Light Guide Using Convectively Distorted Ther- 
mal Gradients in Gases, B.S.T.J., 43, July, 1964, pp. 1476-1479. 


. Kennard, E. H., Kinetic Theory of Gases, McGraw-Hill Book Co., Inc., New 
York, N. Y., 1938, p. 182. 
. Degano, A., private communication. 


2. Marcuse, D. and Miller, S E., Analysis of a Tubular Gas Lens, B.S.TW., 43, 
July, 1964, pp. 1759-1782, 

3. Steler, W. H,, Measurements on a Thermal Gradient Gas Lens, JEEE Trans., 
M TT-136, November, 1965, pp. 740-748. 

4. Miller, 8. E., unpublished work. 

5. Jacob, M., Heat Transfer, Vol. 1, John Wiley & Sons, New York, N. Y., 1949, 
p. 17. 

6. Ibid., p. 433. 

fe 

8 


Acoustic Light Modulators Using 
Optical Heterodyne Mixing 


By R. W. DIXON and E. I. GORDON 
(Manuscript received September 21, 1966) 


Acoustic light modulators are discussed in which the optical carrier 
as reinserted in the diffracted, frequency-shifted light beam. Reinsertion 18 
accomplished in anovel fashion using a Késters prism. Incontrastto the usual 
acoustic modulator, the diffracted light 1s power modulated at the acoustic 
frequency. Modulation bandwidth and depth are each calculated as a func- 
tion of the optical and acoustic beam parameters, assuming a Gaussian 
optical beam and rectangular acoustic beam. It is shown that the modulation 
bandwidth increases proportional to the optical beam diffraction angle and 
equals the inverse of the transit tume of the sound across the warst of the 
optical beam. The optimum modulation depth, for a given acoustic power, 
corresponds to approximate equality of the optical and acoustic diffraction 
angles. Confirming experiments in the frequency range 250/350 M Hz are 
described. 


I. INTRODUCTION 


Detection of optical radiation using heterodyne mixing was pio- 
neered in the classic experiments of Forrester, Gudmundsen, and 
Johnson.* They successfully detected the microwave beat between two 
Zeeman components of a mercury arc. With coherent light sources the 
technique was utilized initially in the investigation of the mode struc- 
ture and frequency stability of the helium-neon laser.? Subsequently, 
optical heterodyne mixing has been used as a sensitive, high-resolution 
detector of frequency shifts in the study of Brillouin scattering*® and 
of the frequency broadening of Rayleigh scattered light.* 

It is well known that under the correct circumstances an optical 
beam, passing through a transparent material containing a traveling 
acoustic wave, has part of its energy diffracted by the refractive index 
variations associated with the acoustic wave. In the proper range of 
parameters, known as the Bragg region, the diffracted light is con- 
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fined around a single direction. The system acts as a single-sideband 
suppressed-carrier modulator in which the diffracted light is intensity 
modulated with the envelope of the modulation subcarrier. Depending 
on the relative directions of the light and sound the diffracted light 
frequency is increased or decreased by the frequency of the modulation 
subcarrier. 

One may arrange to have the diffracted light fall into the same solid 
angle as a portion of the original light, and thereby allow detection 
of the diffracted beam by heterodyne mixing with the undiffracted 
beam. Heterodyne mixing of light in which the signal frequency is 
shifted by diffraction from sound in a liquid cell ultrasonic modulator 
has been demonstrated by Cummins and Knable,> and the increased 
sensitivity of this technique has been briefly pointed out by Goodwin 
and Pedinoff.® 

Optical heterodyne procedures in combination with Bragg scattering 
can also be used as a sensitive detector of sound. For example, 
Lastovska and Benedek® have shown that thermal sound (thermal 
Brillouin scattering) may be detected in this way. In fact, in some 
applications—e.g., at very high frequencies—this technique can be a 
more sensitive detector of sound than the best available transducer. 
The advantages of optical techniques for investigating sound beam 
intensity profiles, angular distributions, etc. at any point in the me- 
dium in which the acoustic wave propagates have been pointed out 
earlier? and may now be supplemented by the increased sensitivity and 
spectral range which the utilization of optical heterodyne detection 
affords. An additional advantage of this method of optical detection is 
that solid-state photodetectors may be used instead of photomulti- 
pliers, without compromising sensitivity, whenever sufficient light 1s 
available that the shot noise associated with the optical local oscilla- 
tor limits detector sensitivity.® 

This paper is concerned with the detailed properties of an acoustic 
modulator when optical heterodyne detection of the modulated light 
is employed. A coherent optical source is assumed. The range of useful 
modulation frequencies for this technique extends at present well 
into the microwave region. Present limitations of efficient thin-film 
transducers limit operation to below 10 GHz. Acoustic loss for some 
applications becomes important at lower frequencies than this. The 
frequency response of commercial photodiodes extends to about 30 
GHz. 

The analysis includes a discussion of modulation bandwidth and 
optimum modulation conditions and concludes with the prediction that 
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very large dynamic bandwidths may be obtained using optical hetero- 
dyne mixing in conjunction with an acoustic modulator. A series of 
experiments involving a novel beam splitter and various modulat- 
ing materials confirm this prediction. It is shown that bandwidth is 
related to the diffraction angle of the optical beam and may be varied 
over a large range by changing this angle. Experiments have been 
restricted to solid modulating materials, but the results are applicable 
without modification to liquids if their higher acoustic loss can be 
accepted. It is concluded that large modulation depths should be prac- 
tical at modulation frequencies well into the microwave region and at 
optical frequencies throughout the visible and infrared. It is also 
pointed out that for small diffracted light intensities, proportionately 
much larger modulation depths are possible using this technique than 
if the transmitted light beam alone were monitored. | 

The relation of modulation depth and bandwidth for a given acoustic 
power is discussed, and experimental confirmation of the conclusions 
is presented. In addition, it is shown that with these acoustic light 
modulators frequency and phase information, as well as power modu- 
lation, may be transferred to the light with large dynamic bandwidth. 


II. DISCUSSION 


2.1 Optical Beam Geometry 

In order to obtain two beams for use in optical heterodyne mixing 
experiments, 1t has been common practice to use a beam splitter and 
mirror assembly similar to that shown schematically in Fig. 1. The con- 
figuration shown would be appropriate for coherent detection of light 
modulated by Bragg diffraction from an acoustic wave.® In the experi- 
ments described here it has been found very useful to replace the mirror 
and beam splitter of such an experiment with a Kosters prism.® These 
prisms have been commonly used in Michelson interferometers and 
similar apparatus where ease of alignment is desired.?° Probably this 
prism has not received the attention it deserves for applications in 
modern physical experiments. 

Fig. 2 shows ‘the general construction of a Kosters prism. Two 
accurately constructed 30-60-90° prisms are carefully cemented to- 
gether with a dielectric beam splitter between them. Because the two 
exit beams travel symmetrical paths to their intersection with any 
plane perpendicular to the beam splitter, they are optically identical. 

The experimental apparatus used in the present series of experiments 
is shown schematically in Fig. 3. The two beams from the KGsters 
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Tig. 1— Mirrors and beam splitters arranged for optical heterodyne detection 
of Bragg diffracted light (after Cummins and Knable, Ref. 5). 


prism are set to intersect within the acoustic beam at an angle equal 
to twice the Bragg angle so that Bragg diffracted light from each of 
the incident beams falls into the diffraction angle of the other beam. 
The Kosters prism assures symmetrical paths for the diffracted and 
undiffracted light and thereby makes alignment very easy. A lens of 
appropriate focal length is positioned so that the optical beam waists 
intersect with the desired convergence angle in the center of the modu- 
lator’s acoustic beam. 

Consider, in Fig. 3, an acoustic traveling wave originating from the 
transducer. The frequency of the light diffracted from beam 1 into 
beam 2 is increased by the acoustic frequency while the light diffracted 
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Vig. 2— A K6sters prism or double image beam splitter. 


ACOUSTIC LIGHT MODULATORS 371 












KOSTERS i BALANCED 
PRISM SOLE IN MIXER 
l 
i - | [fein 
me PREAMP} TF OR VIDEO 
a OUTPUT 
ne TO CRO 
‘\ TRANS- HYBRID 
> DUCER 
—S LENS LOCAL 
on OSCILLATOR 


CUTOFF 


ATTENUATOR 
He Ne LASER 
6328A 
MICRODOT 
OSCILLATOR PULSE 
200-500 MHz gaia ta lacos 


Fig. 3—Schematic diagram of experimental apparatus. 


from beam 2 into beam 1 is decreased by the same amount. The 
diffracted and transmitted beams interfere at the photodiodes after 
which the difference frequencies are amplified and detected. Since the 
difference frequency signal from diode 1 is 180 degrees out of phase 
with that from diode 2, the two signals may be fed, if desired, into a 
hybrid and will add at the port for which the output is the difference 
between the two inputs (the push-pull output). This has the advantage, 
which was not important in the present experiments but which might 
be for some applications, that amplitude noise on the incident optical 
beam (such as the beating of various laser modes) does not appear at 
the same output as the detected signal if the two sides of the detection 
system are balanced. In some of the experiments discussed only one 
diode was used, in others the balanced system worked very well and 
was no more difficult to align than the system employing a single diode. 
Modulation frequencies were normally near 300 MHz. 


2.2 Modulator Bandwidth 


In order to appreciate the limits on modulation bandwidth, it is 
instructive to consider qualitatively several special cases. For simplicity, 
assume that only photodiode 2 is used and that only an outgoing acoustic 
wave is present. The Késters prism is positioned so that the two light 
beams of frequency v which are incident on the acoustic modulator have 
their point of intersection at the center of the acoustic beam. They 
intersect at twice the Bragg angle, 26,, [sin 8, = 4f,A/v], which is a 
function of the desired modulator center frequency f,, the acoustic 
velocity v, and the optical wavelength in the medium X. The optical 
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Fig. 4— A k-vector diagram of the Bragg diffraction process in which optical 
and acoustic diffraction angles are assumed negligibly small. 


velocity is c’. The wave vector relation (momentum conservation) 
among the three waves may be writtenk, + K = k’, where|k, | = 27v/c’ 
and | k’ | = 2r(v + f,)/c’ are the incident and scattered optical k-vectors, 
and K is the k-vector of the ultrasonic wave. Since to order | f,/v | « 1, 
|k, | = |k’ | itis possible to make the elementary but useful construction 
shown in Fig. 4. The dotted circle has radius | k, | and defines the locus 
of allowed | k’ |. Only a phonon of precisely the correct | K | will scatter 
k, if one assumes that neither the optical nor acoustic beam has any 
angular width. 

In order to appreciate the effect of nonzero diffraction angle, consider 
the limiting case in which the diffraction angle of the acoustic beam is 
large compared with the diffraction angle of the optical beam (Fig. 5); 
K has a well-defined magnitude but an angular width A@. Only those 
acoustic k-vectors near the direction of K, scatter light into spatial 
coherence with the heterodyning beam k’. Thus, the detected signal 
amplitude is lower than if the same acoustic power were confined to a 
smaller diffraction angle. 

If the acoustic frequency is increased to a new value, | K | is increased 
and the construction shown dashed is appropriate. No signal will be 
observed on the photodiode because no K can scatter into k’. This 
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modulator therefore, not only produces low modulation intensity but 
also possesses small bandwidth. 

Now consider the other limiting case in which the diffraction angle 
of the optical beam is much larger than that of the acoustic beam (Fig. 
6). In this case, only that portion of the optical energy near the center 
of the diffraction angle can be scattered by the acoustic wave of vector 
K, and heterodyned with the other optical wave. Therefore, the scat- 
tered intensity is much less than it would be for the same optical and 
acoustic powers, if the diffraction angle of the light were decreased. 

If the frequency f, is changed slightly, a new K is defined which is 
slightly different in magnitude but which has the same direction as 
K, . In this case, however, the detected optical signal at the photodiode 
is essentially unchanged as long as the deviation AK is such that K, + 
AK still connects two points on the dashed circle which are within the 
diffraction angles of the incident and hetrodyning optical beams. By 
varying f one traces out the angular profile of the optical beam. A 
bandwidth may be approximately defined by the condition 






ee Gee oa ere a /2 A6,k, cos 6, (1) 
from which it follows that the bandwidth Af, is given by 
Afs &% V2 (v/d) A@, cos 8, (2) 
m= 
~ re aa ny 

gON ~ 

\ / 

\ / 


/ 
/ 


Fig. 5— A k-vector diagram of the Bragg diffraction process in which the 
acoustic diffraction angle is large compared with the optical diffraction angle. 
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in which X is the optical wavelength. The center frequency f, is defined 
by sin 6, = 3f.v/d. The optical beam diffraction angle A@, may be ex- 
pressed in terms of the beam waist diameter w, ,°’ corresponding to a 
Gaussian beam for which w, is the full width at half intensity, 


w, Ad, & (222, (3) 
T 
Substituting for Aé@, yields for the bandwidth 
afew (22 m2) 2 00s 8, (4) 
7 Wo 


which approximates the reciprocal of the transit time of the sound across 
the waist of the gaussian light beam. This important result shows how 
acoustic modulators with large bandwidths are made possible by in- 
creasing the diffraction angle of the optical beam. The fact that the 
diffraction angle A@, of the optical beam must be less than the Bragg 
angle requires that the fractional bandwidth of the modulator obeys 
the inequality, using (2) and (8), 
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Fig. 6— A k-vector diagram of the Bragg diffraction process showing the origin 
of acoustic modulator bandwidth; the optical diffraction angle is large compared 
with the acoustic diffraction angle. 
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Thus, fractional bandwidths as large as 70 percent are possible if trans- 
ducer bandwidths can be made compatable with this requirement. 

It should be clear now for the situation depicted in Fig. 6 that increas- 
ing the acoustic diffraction angle would increase the detected signal 
without affecting the bandwidth. Likewise, from Fig. 5, in the other 
extreme the beat signal increases but the bandwidth is unchanged when 
the acoustic diffraction angle is decreased. It is therefore, plausible on 
the basis of the foregoing comments that the choice of acoustic diffraction 
angle approximately equal to optical diffraction angle is the optimum 
choice for a given bandwidth. The bandwidth is determined entirely 
by the diffraction angle of the optical beam. This conclusion is given 
quantitative expression in Appendix A. 


WI. THEORY AND COMPARISON WITH EXPERIMENT 


In order to check the preceding ideas quantitatively, the photodiode 
current which would be expected with the geometry shown in Fig. 3 
was calculated for the case of light beams with identical Gaussian pro- 
files and an acoustic beam with rectangular cross section. It was known 
that these beam profiles could be accurately approximated experi- 
mentally. The lens was positioned in the incident optical beam so that 
the optical beam waist occurred at the center of the modulator. The 
diode photocurrent was computed as the far-field interference integral 
of the product of the acoustically diffracted optical amplitude and the 
heterodyning optical amplitude. Details of the calculation are presented 
in Appendix A. The expression for the component of photocurrent at 
the modulating frequency f, apart from material and numerical con- 
stants, 1s 


Poot(Pacoustic)’ XD [22xf(t — R/c’)] 


‘Ef @ Erf © exp | — ie Gai e (6) 
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Again w, 1s the full width at half intensity of the Gaussian beam, 26, 
is the intersection angle inside the modulating medium and f, is the 
acoustic frequency for which 6, is the Bragg angle. The acoustic beam 
height is h and its width (dimension in the plane formed by the acoustic 
and optic propagation directions) is L. The distance to the detector is 
k. Equation (6) is the basic relation which is subject to experimental 
verification. 

The first experimental checks of (6) confirmed that the acoustic and 
optical power dependences are correct. Experimentally, a 3-dB decrease 
in optical intensity or a 6-dB decrease in acoustic power as expected 
decreased the detected current by one-half. The other most interesting 
predictions made by (6) are contained in the terms Erf(b)/b? and 

7G = Lr | 
ae |- 41n2 v* cos’ 6, | 
The former is concerned with the maximum detected signal amplitude 
and the latter with the dynamic modulation bandwidth, both as func- 
tions of the diffraction angle of the incident light. 





3.1 Bandwidth vs Beam Waist Diameter 


Consider first the term 


exp | - i C1). 


4 ln 2 v” cos’ 6, 





When the modulating frequency f changes from the value f,, this 
term describes the decrease in detected signal. An acoustic half-power 
bandwidth, Afg , for which the detected current is greater than 1/V2 
below its maximum value, may be defined and is given by 

w, Afz = ave n2), cos 6, . (7) 
The beam waist diameter times the bandwidth is thus a constant for 
a given material at a given center frequency f,(@,). By making the beam 
waist smaller, e.g., by focusing the incident optical beam, the dynamic 
bandwidth may be increased. As indicated earlier the bandwidth is 
intimately related to the transit time of the acoustic wave across the 
optical beam. In fused quartz for longitudinal waves at frequencies low 
enough that cos @, & 1, (7) becomes 


w, Afg = 3.70 X 10° cm/sec. (8) 


In Fig. 7 the measured bandwidth Afz , for modulation in fused quartz, 
is plotted against w,* and (8) is shown plotted as the solid line. The 
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Fig. 7 — Experimental dynamic bandwidth plotted against the reciprocal of the 
optical beam waist diameter. The solid line is the theoretical expression, (8). 


value of w, was varied by placing lenses of different focal lengths F 
in the incident optical beam. The relation between w, and F was assumed 
to be of the form given by Kogelnik*’ except that w, is defined as the 
full width at half intensity of the Gaussian beam. In the limit appropriate 
for the present experiments 
2 In 2\ Fr, 
oar ( T W.’ (9) 
in which W, is the beam waist of the incident laser beam and w, is the 
beam waist of the beam in the scattering medium following its trans- 
formation by a lens of focal length F. Here, \, is the free space wave- 
length. Equation (9) was used to convert from values of F to values of 
w,. Using a scanning slit and photomultipher the value W, = (1.00 + 
0.01)mm was found. These measurements incidentally verified that the 
intensity profile of the laser beam was approximately Gaussian. 

The experimental points in Fig. 7 were obtained using apparatus 
which is shown schematically in Fig. 3. The Microdot oscillator produced 
repetitive pulses of RF energy with each pulse having about 1-usec 
duration. Cadmium sulfide thin-film transducers converted these elec- 
tromagnetic pulses into acoustic energy.” The deflected light pulses 
were detected in the indicated photodiode geometry followed by a stand- 
ard microwave superheterodyne receiver. In order to avoid IF detector 
diode nonlinearity the IF output of the system was usually viewed di- 
rectly on a broadband oscilloscope. Data were taken from the amplitude 
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of the first deflected light pulse (due to the initial outgoing acoustic 
wave) and were therefore, independent of acoustic resonance effects 
which would be associated with long acoustic pulses. The RF power 
incident on the transducer was kept the same at each frequency. A 
small amount of each RF pulse was fed, using a beyond cutoff attenuator, 
from in front of the transducer to the input of the receiver. This pulse 
served as a calibrating signal for the receiver and helped to correct for 
changes in receiver sensitivity as the frequency was changed. The 
transducer response in the 100-MHz region around 300 MHz was flat 
to within 1 dB. The scatter of the experimental points can be attributed 
to several causes, the most important of which is the unavoidable small 
changes in optical alignment between the time that each bandwidth 
curve was taken. It is believed that the largest observed value for any 
given value of w, is the most appropriate and the agreement is considered 
to be good. 


3.2 Signal Amplitude vs Beam Waist Diameter 


Now consider the amplitude terms in (6) which involve the error 
functions. When f = f, 


G5 Erf wee me (© | (10) 
where 
v= (22) 2 
ND W, 
, Lsin. Bo 


= (In 2)* —— — 
In the experiments, the acoustic beam height h was made sufficiently 
large (3 mm) compared with the largest beam waist (1 mm) that 
Erf(a) 1 for all values of w, of interest. Thus, the signal current as 
a function of w, is given by 


1 Lsin 6, Esin | (11) 


o 


A(w,) = A,w, Erf Lan 2) 


This equation is plotted as the solid curve in Fig. 8; A, has been con- 
sidered an adjustable normalizing parameter but the argument of the 
error function is determined using the experimental acoustic beam width 
of L = 7.00 mm. The experimental points were taken using the con- 
figuration shown in Fig. 3 and the beam waist diameter was obtained 
using (9). Again the agreement is quite good. 
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Fig. 8— Experimental relative modulation frequency photocurrent vs beam 
waist diameter. The solid curve is the normalized theoretical expression, (11). 


3.3 Signal Amplitude vs Transducer Width 


The main interest thus far has been in determining the result on the 
signal amplitude of varying the optical diffraction angle and demonstrat- 
ing that it is solely responsible for the bandwidth. It is instructive to 
re-emphasize the effect of the acoustic diffraction angle by calculating 
the detected signal variation as a function of acoustic beam width L. 
For a given w,, h, 0, and acoustic power the detected signal amplitude 
at f = f,1s given by 


A(L) = 4, oe 
(12) 
pede 9)! ae 6, 


oO 


It should be noted that b can also be written as 0.85 X (diffraction angle 
of light/diffraction angle of the sound). 
In the region of small L, (b « 1) 


A(L) = A,(2/r')b? « Li, (13) 


Therefore, for a given acoustic power, the signal may be increased by 
increasing the transducer width. The diffraction angle of the sound is 
too large for optimum scattering from the given optical beam. In the 
opposite limit of large L, (b > 1) 
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A(L) = A,/bt « L", (14) 


Increasing L decreases the signal. Here, the acoustic diffraction angle 
is too small to use all of the incident light. 

The signal amplitude has a broad maximum at b = 0.99 corresponding 
to approximate equality of the acoustic and optical diffraction angles for 
optimum modulator efficiency. This conclusion is expected to depend 
only weakly on the actual beam geometries. 


IV. MODULATION DEPTH 


Consider now the modulation depth which one would expect from an 
acoustic modulator. A simple computation of modulation depth is 
possible only when the waves are considered in a plane wave approxi- 
mation in the Bragg region. This situation should approximate qualita- 
tively the behavior expected with the present experimental configura- 
tion. In the Bragg region, there exists a well-defined angular relationship 
among the beam directions. Furthermore, all of the light diffracted 
from the main beam is diffracted into a single Bragg order. Under these 
circumstances, the solution of the problem of the generation of an optical 
beam by parametric coupling with an acoustic beam shows that the 
amplitude of the diffracted beam can be written™ 


Pi = +iP?,, sin y' exp 727 + foi, (15) 
while the transmitted beam has the form 
P} = P?,, cos n? exp i2avt (16) 


in which y is a scattering parameter defined for a rectangular acoustic 


beam by’® 
6.2 
— 1,2(%P at 
a (= Iter cos’ 6,/ ’ 7) 


where L is the beam width, P, the acoustic power, h the acoustic beam 
height, p the appropriate photoelastic component, v the acoustic velocity, 
and p the mass density, and n the refractive index. 

For conventional acoustic modulation, a square law photodetector 
placed in the transmitted beam will produce a photocurrent proportional 
to |P,| which has a maximum value of P,,, for no acoustic signal and 
a minimum value P,,, cos’7? when the acoustic signal is present. Hence, 
a modulation depth 


m, = sin? 7} (18) 
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may be defined which is some measure of the ability to detect the in- 
fluence of the acoustic energy on the light. 
Similarly for the superheterodyne case, the photo-current has the form 


| P} + P} |? = P..(1 & sin 2n’ sin 2rft) 
and 


2 sin 2n? 
1 + sin 27? 


Clearly, this modulation depth is superior to that obtained when the 
carrier is simply intensity modulated. This superiority 1s most dramatic 
when small deflected intensities are involved. Fig. 9 shows a comparison 
of the two modulation depths each plotted as a function of the scattering 
parameter 7 (which for » < 0.1 is quite accurately equal to the de- 
flected intensity). It is seen that when 10 percent deflected intensity is 
obtained, the modulation depth with the optical heterodyne system is 
75 percent. For the very small deflected intensity of 0.01 percent, one 
still has a usable modulation depth of 4 percent in the optical heterodyne 
detector. 
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Fig. 9— Theoretical curves, valid for plane waves, of the ordinary optical 
modulation depth m: and the optical heterodyne modulation depth mez both 
plotted vs acoustic scattering parameter 7. 
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Modulation depth was experimentally investigated using a modula- 
tion frequency of 20 MHz which was chosen instead of a frequency near 
300 MHz as in the other experiments described in this paper in order 
to display the modulation depth directly. At 20 MHz one photodiode 
could be directly connected to an oscilloscope with amplifiers having 
sufficient baseband bandwidth and gain to display both the de diode 
photocurrent and the 20-MHz modulating photocurrent. Fig. 10 shows 
a typical oscilloscope trace of the diode output for longitudinal acoustic 
waves propagating in KRS-5 when the relative diffracted light intensity 
was about 20 percent. The baseline corresponds to no light. Fig. 11 
shows a comparison of the modulation depth measured from such photo- 
graphs compared with the total intensity diffracted from the incident 
beam under the same experimental conditions. KRS-5 was again the 
modulating medium. The very large modulation depths obtained for 
small diffracted intensities are, of course, the most interesting feature 
of these curves and qualitatively verify the ideas just discussed. 

There is some disadvantage in working at a frequency as low as 20 
MHz, viz., that the optical-acoustic interaction is not strictly in the 
Bragg region and a significant amount of light from the main optical 
beam is diffracted into orders other than that satisfying the Bragg 
condition. For this reason, the curve of m, against acoustic power does 
not reach 100 percent as it would if the modulating frequency were 
high enough (greater than about 60 MHz) that Bragg diffraction was 
dominant. At these increased frequencies a direct display of the modula- 





Fig. 10 — Oscilloscope trace showing 80 percent modulation depth at 20 MHz 
obtained using optical heterodyne detection. The relative diffracted intensity was 
about 20 percent. 
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Fig. 11 — Experimental comparison of ordinary modulation depth m: and het- 
erodyne modulation depth mz for logitudinal acoustic waves in KRS-5. 


tion depth, such as that shown in Fig. 10, becomes difficult for the small 
photocurrents used experimentally. 


V. MEASUREMENT OF ACOUSTIC DIFFRACTION PATTERN 


It is known that Bragg diffraction can be used to determine the angular 
distribution or diffraction pattern of the acoustic beam.’ Using a light 
beam sufficiently collimated that the diffraction angle of the light is 
much less than that of the sound, measurement of the scattered light 
power as a function of the angle of incidence of the light relative to 
the Bragg angle yields directly the angular distribution of acoustic 
energy. The angle of incidence is changed by slowly rocking the acoustic 
medium. 

A similar experiment can be performed using the arrangement of 
Fig. 3. In this case, the component of photo-current at the acoustic 
frequency measures the amplitude of the acoustic angular distribution. 

Typical results are shown in Fig. 12. Photograph (a) illustrates the 
case of a well-collimated light beam with diffraction angle much smaller 
than that of the acoustic beam, (b) and (c) are for progressively larger 
optical diffraction angles. In (c) the optical diffraction angle is large 
enough that the curve illustrates the Gaussian character of the light 
beam. The deviations from the expected sin X/X behavior relate to 
the lack of antireflection coatings on the acoustic medium.’ 

Homodyne detection of the photocurrent using the acoustic input 
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Fig. 12— Oscilloscope display of the modulation frequency photocurrent vs 
angle of rotation of the modulating medium (cf., Fig. 3). Traces (a), (b), and 
(c) show results for increasing values of the optical beam diffraction angle, 
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signal as a reference would allow determination of the phase of the 
acoustic angular distribution as well as the amplitude. Thus, this 
technique has some significant advantages over the original experi- 
ments described in Ref. 7. The increased sensitivity of this technique 
should also be noted. 


VI. CONCLUSION 


A novel type of acoustic modulator arrangement has been described 
which allows superheterodyne detection of the diffracted light. 

Distinct from electro-optic modulators, the optical modulation side- 
bands of acoustic devices are well separated in angle from the optical 
carrier and intensity modulation at the subcarrier modulation frequency 
is not possible. It has been shown that the optical carrier may be rein- 
serted in the appropriate direction in a simple and convenient fashion 
allowing intensity modulation at the modulation frequency. 

Optimum modulator configurations, corresponding to approximate 
equality of the optical and acoustic diffraction angles have been derived 
and the modulation bandwidth has been shown to be proportional to 
this angle or alternately to be equal to the inverse of the acoustic transit 
time across the optical beam waist. Confirming experiments in the 
frequency range 250-350 MHz have been described. 
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APPENDIX A 


Calculation of Photodetector Output 


The photodetector is usually placed in the focal plane of a collecting 
lens. For the purpose of calculation, one may assume that the surface 
of the photodetector is hemispherical, centered on the interaction 
volume, and sufficiently large that detection occurs in the optical far- 
field. The photocurrent 1s proportional to the instantaneous integrated 
intensity or power falling on the surface. The component of the power 
or photo-current at the acoustic frequency is proportional to the inter- 
ference integral of the transmitted and diffracted optical beams. 

The calculation is performed by determining the angular dependence 
of the transmitted and diffracted beams and integrating the product 
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Fig. 18 — Coordinate system used in the calculation of the heterodyne photo- 
current. 


of the two functions on the surface of a sphere of radius &. The observa- 
tion point (X, Y, Z) in Fig. 13 has coordinates 


X =f sin 0 
Y = R cos 6 cos ¢ 
Z =R cos 6 sin ¢. 


The near-field amplitude distribution of the transmitted Gaussian beam 
which is incident in the x, — y. plane at angle 9, is given by 


V(o > Yo » Zo y Io) 


2 In 


= y, exp—| - z (2p COS O,—YoSin 8.) +20] +2k(Yo COS O.-F Xo sin 0) | 





(20) 


in which w, is the half-power beam diameter and k is the propagation 
constant. The amplitude on the sphere is given by 
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W(X, Y,Z) = = | az, | dix, | exp = br (-24) 
—o —o JO (21) 
_O exp — a | 
+ W(x. ’ Yo eee OY, r xs 
ra(X Sa) 4(Y =9) eH 2) / 


corresponding to the usual Green’s function solution for an outgoing 
wave from a source in the x, , z, plane. Making the far-field approxima- 
tions x,/R, z./R << 1 and kR >> 1, where applicable, and performing 
the integration yields for the far-field amplitude 





_, hepa ik, ( ws 
vy. = Vv, 3P [cos 6, + cos @ cos ¢] i cose: 
kw? 
X exp — a7 9 [(cos @ sin vg)” + (sin 6, — sin 6)’/cos’ 6,] (22) 


which can be recognized as the angular dependence of a Gaussian beam 
including the obliquity factor. The time-dependence is exp 727vt. 

The diffracted beam of frequency v + f arises from a perturbation in 
the optical polarizability of the medium interacting with an optical beam 
moving at angle —6,. The perturbation 1s proportional to the strain 
amplitude of the acoustic beam through the appropriate components of 
the photoelastic tensor. The volume polarization, at frequency v + f, 
in the limit of scattering sufficiently weak that the local field amplitude 
of the incident beam is essentially unchanged by the presence of the 
acoustic wave, may be written 


p(2. »Yos Zo) exp 12a (v ae ft 
= Wi(%. , Yo» Zo, — 9.) exp (12mvt)S, exp 1(2arft — Kz.) 
—3L Sy, S$ 3b — 3h S 2 S gh (23) 


and zero otherwise. A rectangular acoustic beam of width ZL and height 
h, moving along the x,-axis, has been assumed. The constant S, rep- 
resents the perturbed susceptibility which is proportional to the strain 
amplitude. The function (x, , Yo, 2, — 9) is given by (20) with the 
appropriate change in the sign of @, . The diffracted beam amplitude at 
frequency v + f and propagation constant k’ = 27r(v + f)/c’ may be 
written 


$00 +RL 44h ey, 
bX, ¥,2)= | def dye | devoles , ys» 2) “P= ** (aay 


corresponding to the volume Green’s function solution for an outgoing 
wave. 
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The integral in (24) may be evaluated in a straightforward but 
lengthy fashion to yield 


— iW (aL inane 
Vir = PSo = (2) ReErf 20'() 7 ie eos Osim | 


sin (€ — 7 tan 6,)L/2 hk’? 2 
year ris a exP ~ “48 | cos Osim’ g + oo | , (25) 


in which 


x 


_ 2In2 
aoe 


B 


’ 


7 =k’sné@+ ksin 6, — K, 
£ = k’ cos 0 cosy — k cos @,, 


and ReErf(z) is the real part of the error function of complex argument. 
It may be shown that for the parameters of interest one can make 
the approximation 


tk’ cos @sin ¢ 


3 )| ew Erf [46h] 


ReErf E. (r 


with negligible error. 
Except for constant factors the photocurrent may be written 


27 w/2 
f) = R | a [ |, 49-008 OY (26) 


The integral in (26) may be evaluated using (25) and (22). After ap- 
propriate manipulations the expression for the photocurrent takes the 
form 








_ 1 r° 4 : , 
(f) — a: In 9 (2 6, Piya 2 wcastio) exp 12rf (i = R/c’) 


. | Bt 1G tn 2 b/are] | Bt Kim 28/0.) sin (27) 
(Gln 2)'h/w.}F IL [dn 2(L/w,) sin 6,7 


wif — f.)"ws 


eS a 4 In 2v” cos” 6, ’ 


in which 


f, = (2v/d) sin 6, 
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is the optimum frequency for Bragg diffraction and 





2 
P seieni = TWo 


Pecustic = Lh | So ie 
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Water Vapor Permeability of 
Polyethylene and Other 
Plastic Materials 


By R. L. HAMILTON 
(Manuscript received October 6, 1966) 


Low-density polyethylene sheathing materials have water vapor per- 
meabilities on the order of 107° at 22°C. High-density polyethylenes have 
permeabilities about one-third to one-sixth that of the low-density poly- 
ethylene. Copolymers of polyethylene have higher permeabilities than the 
homopolymer. As an example, 15 percent ethyl acrylate comonomer 1in- 
creased the permeability by a factor of 10 over that of straight low-density 
polyethylene. The nonolefinic polymers tested have higher permeabilities. 
For example, polyurethane plug compound has a permeability more than 
80 times higher than low-density polyethylene. Finally, 1t was found that 
the addition of carbon black decreases the water vapor permeability roughly 
an proportion to the amount of carbon black, and that the permeability 
of these materials increases with increasing porosity. 

To make these measurements, two types of laboratory apparatus have 
been constructed. The first of these makes the permeability measurement 
on a tubular sample of the material, and the other on films. Both methods 
used an electrolytic moisture monitor, which is commercially avarlable, to 
make necessary determinations of water transfer rate through the plastic. 


I. INTRODUCTION 


The post war years have seen a phenomenal proliferation of plastic 
materials throughout industry, and the Bell System has been no ex- 
ception. At least part of the reason for this widespread and ever in- 
creasing use 1s the attractive ease of fabrication of plastic materials 
and their relatively low cost. Their inertness to certain environmental 
factors and their chemical and physical stability also add to their 
value in a great number of applications, including environmental pro- 
tection. Not only are these plastics used to enclose and isolate an en- 
tire apparatus or structure from its environment, they are also used 
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as protective coatings and as seals such as O-rings in metal containers. 
One of the undesirable and detrimental factors in the environment is 
water, and consequently, a need developed for information on the 
water resistant characteristics of the many plastics in use. This water 
resistant characteristic is the water vapor permeability (WVP) of the 
material, and has been measured for several materials of interest. This 
paper discusses the need for the measurements, the method of making 
and correlating measurements, and the significance of the results. 

Several problems have been documented involving moisture transfer 
rates in complicated, composite plastic materials. Water in small 
amounts, particularly in pulp insulated cable, has detrimental effects 
on the electrical characteristics of the core. Splicing an Alpeth cable 
into a paper insulated cable results in moisture accumulation in the 
latter cable, presumably as a result of moisture diffusion through the 
Alpeth sheath and subsequent migration into the paper core. Air fed 
into pulp cable through polyethylene air tubing also results in mois- 
ture accumulation in the pulp because of water transmission through 
the tubing walls. 

Most of the design and investigational calculations made in regard 
to these problems were based on sparse data for more or less idealized 
systems. Unfortunately, for the materials of interest, there are no 
moisture transfer rate data available to the engineer. The studies de- 
scribed in this report were motivated by this paucity of data and it 
was planned to acquire such data for practically pertinent systems 
for the design engineer. Although the work is directed toward prob- 
lems arising in the cable plant, the data and results of this study may 
be of use in other areas employing plastic materials. 

In order to correlate and understand the effects of different parame- 
ters on permeation, it was necessary to include in the investigation 
measurements on systems far removed from practically applicable 
systems. For example, an investigation of the effect of carbon black 
loading on permeability must include measurements on natural (un- 
loaded) polymers as well as actual sheathing materials which con- 
tain carbon black. 

The reader will not be subjected to a long and detailed review and 
analysis of the previous literature in this field. There are two reasons 
for this; first, recent reviews’? have been given of the literature on 
permeation processes in plastics. Secondly, the previous literature is 
principally concerned with purified materials and is not highly perti- 
nent to the problems and information dealt with in this paper. Previous 
papers?? have indicated that the permeabilities of polyethylenes to 
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water vapor are quite low, on the order of 2 x 10°-° scce*/sec-cm-cm 
Hg. Also, the previous work has shown that permeability decreases 
with increasing polymer density.? In addition, it has been found that 
the permeability is an exponential function of reciprocal absolute 
temperature** and that the activation energy for permeation ranges 
from some 6 kilocalories to almost 10 kilocalories.*+ 

Water vapor permeabilities of other materials have also been given 
in the literature. It is generally found that the permeabilities of ma- 
terials such as nylon, cellophane, and other nonolefinic polymers are 
much higher than permeabilities of polyethylene.*?)+ 

There is some disagreement?* among previously published values of 
permeability of polyethylene, and this has been explained® in terms 
of differences in the samples of materials used, but all the previous 
work agrees on one point: The transport of water vapor through 
polyethylene obeys Fick’s and Henry’s laws. Fick’s law relates mass 
transfer rate M to a concentration gradient and in finite difference 
form is 

| AC 
M= DA AY yo (1) 

where D is the diffusion coefficient, A is an area, and AC is a concen- 
tration difference across an increment in length, AX. Henry’s law is 


C = Sp, (2) 
where S is solubility and p is the vapor pressure of penetrant. If 
Henry’s law and Fick’s law are combined 


Wy Ap. 
M = DSA re (3) 


Usually, 


DS =P, (4) 


the permeability. 

These laws are used to correlate moisture diffusion rates by cal- 
culating P from the definition and data on transfer rates. These data 
must include, of course, M, Ap, AX. The principal experimental prob- 
lem is measuring these factors on practically pertinent systems so 
that P can be calculated. 

Essentially, the plastic sample separates two chambers, one of which 


* The term scc refers to cubic centimeter of vapor at standard temperature 
and pressure. 
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contains water at a known temperature. As this water permeates the 
sample, it is swept from the other chamber (with a carrier gas) and 
into a water measuring instrument. From the geometry of the sample, 
A and AX in (38) can be determined and from the water temperature, 
Ap can be obtained. The problem is in measuring M, the mass flow 
through the sample. The water measuring instrument must be capable 
of measuring very small amounts of water: for example, if a plastic 
tube is 10 cm long, 2.5 cm in diameter with a wall thickness of 2 mm, 
there will be only 5 pg of water per hour permeating through the wall. 
Previous workers have indicated the rather large amounts of water 
that can accumulate in some of the older cable designs but this is in 
terms of miles of cable and years of time. Obviously, for experimental 
facility we must use shorter lengths of cable and must be able to work 
on a much shorter time scale and this, in turn, forces one to work 
with very minute quantities of water. So there is no inconsistency in 
the larger amounts of water in the cable plant problems and minute 
amounts encountered in the laboratory experiments. 


II. DESCRIPTION OF EXPERIMENTAL APPARATUS 


Moll? has described some of the more widely used techniques of 
measurement of permeability. Most of these methods were not directly 
applicable to water permeation in the materials of practical engineer- 
ing interest. One such earlier method uses a thermistor (to detect 
water) which can become fouled from plasticizers and other volatile 
additives in sheathing material. The “cup method” is usually used on 
materials with higher permeabilitics. Moreover, both methods are more 
suited to very thin samples—less than 5 mils and sometimes as low as 
1 mil.* At these thicknesses, surface imperfections (holes, pits, etc.) 
can account for a large part of the water transferred across the film. 
For these reasons, it was felt desirable to construct a new apparatus 
based on an electrolytic water measuring technique which has proved 
reliable.® | 

An instrument capable of making the necessary water measure- 
ments is available commercially (Consolidated Electrodynamics Cor- 
poration). This “moisture monitor” operates as follows: A glass cell 
is coated with phosphorus pentoxide which is a tenacious absorbent 
for water. The coat of phosphorus pentoxide is interspersed with 
platinum electrodes and a carrier gas sweeps the moisture from the 
test sample and into the cell. As the water is absorbed by the phos- 
phorus pentoxide it is clectrolyzed to hydrogen and oxygen and the 
current necessary for electrolysis is directly related to the amount of 
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water so that the detector is calibrated essentially by the definition of 
electrical current. The precision of measurement depends on the preci- 
sion of & microammeter. The detector also has the advantage of being 
specific for water: other materials (such as antioxidants and light 
oils) which diffuse through the plastic or out of it do not interfere 
with the water measurement. 

Several factors are involved in deciding on size and form the sam- 
ple should have. Considerations of accuracy determine the size of the 
sample; the larger the sample area the greater the rate of water 
penetration which, in turn, permits more accurate measurement. 

The apparatus should be capable of making measurements on tubu- 
lar samples such as cable sheath and air tubing, for example, but 
certain test materials are not available in quantities sufficient to ex- 
trude tubes and certain others (e.g., polyurethane) cannot be extruded 
easily so that it was necessary to make measurements on films of 
these materials. Obviously, two different methods must be used—one 
for tubes and one for films. 


III. TUBULAR APPARATUS 


Briefly, the tube apparatus (Fig. 1) consists of one or more tubes 
of plastic material submerged in a tank of water and connected at 
one end to the “moisture monitor.” Dry carrier gas is forced through the 
tube to sweep out water which has permeated the tube wall. The rate 
at which moisture is registered on the moisture monitor gives M in 
ug /sec and from this and the geometry of the tube P is calculated. 


3.1 Carrier Gas Supply 


The water is swept out of the plastic tube and into the moisture 
monitor with an inert carrier gas and it is imperative, for accuracy, 
to have the carrier gas enter the tube dry as possible. The carrier gas 
used has a water content of approximately three parts per million 
which is in the order of the water content of the gas in the tube. Thus, 
it 1s necessary to remove even this small amount. This operation is 
accomplished with an electrolytic drier cell as shown in the upper left 
of Fig. 1. The “wet” carrier gas from the supply cylinder is passed 
through the cell and the water is absorbed on the phosphorous pent- 
oxide and electrolyzed. This gives a carrier gas free of moisture. 


3.2 Lnd Seals 


The ends of the plastic tubes are sealed into the apparatus with 
“Swagelok” fittings. These fittings were originally designed for metal 
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Fig. 1— Moisture permeability apparatus for tubular samples. 
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tubing but they were adapted to plastic tubing by inserting a support- 
ing element inside the tubing. These end seals were thoroughly tested 
and proved very effective as discussed below. 

The seal consists of an anvil and chuck and two swaged ferrules 
supported by an insert inside the plastic tube. When the anvil and 
chuck are tightened together, the ferrules deform inward against the 
tubing which is supported by the insert. The ferrules also make a 
tight metal-to-metal seal against the chuck and anvil, respectively, 
so there is no path for vapor leakage through the fittings. 


3.3 Fittings and Other Equipment 


Stainless steel tubing is used in parts of the apparatus which could 
contain water to minimize sorption of water on the tubing walls. In 
other parts of the apparatus which contain only dry gas, copper tub- 
ing is used because it is much easier to fabricate. 

The outlet from the moisture monitor is connected to a flow meter to 
insure that the carrier gas is not flowing so rapidly as to “flood” the 
electrolytic cell and permit water to escape electrolysis. 


3.4 Temperature Control 


Temperature affects permeability and the temperature of the water 
must be controlled accurately. This is accomplished with a large, con- 
stant temperature heat sink consisting of a glycol bath cooled con- 
tinuously with a refrigerator. The water is circulated continuously 
(two gallons per minute) from the sample tank through coils in the 
cold glycol bath and then back into the tank. This gives a large heat 
sink and effectively isolates the sample tank temperature from room 
temperature variations. The temperature of the water is regulated by 
heating with a temperature controller. Temperature variations can be 
kept below 0.1°C at 30°C or above. The water in the tank is stirred 
to keep a uniform temperature throughout. A ten gallon per minute 
pump, located outside the tank, pumps water from the tank and 
directly back to the tank through a tube with holes. This effectively 
stirs and distributes the water and keeps it at a uniform temperature. 


3.5 Measurements 

To calculate P from the measurements on tubes, (4) must be put in 
cylindrical coordinates. This form of the equation has been given 
previously® 


_ M1in(D,/D;) 


i _ ’ 
2r(p. = p:)L 
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where L is the sample tube length and D; and D, are inside and out- 
side tube diameters. p; and p, are water vapor pressures inside and 
outside the tube. Because the dry carrier gas sweeps water from inside 
the tube, p; is small and can be neglected in comparison to 7. 

The inside and outside diameter required care in measurement be- 
cause the calculated values of P are sensitive to errors in D, and D, 
of the tube samples. These diameters were measured accurately in 
the case of clear plastic by means of water displacement. A measured 
length of the sample tube Z (usually one meter) was filled with an 
accurately measured volume V of distilled water. Obviously, 


rD?L 
aa al 
or 
D;, = V4V/aL. 

D,; can be calculated from the volume of water V, contained in the 
length of tube, Z. The thickness of the tube wall was measured by 
encapsulating a short segment of tube in epoxy resin and polishing the 
mount down until the tube cross section was exposed. The wall thick- 
ness was then measured with a stage micrometer. D, can be obtained 
from D, and the wall thickness. The outside diameters of opaque tubes 
are measured by inserting a sample tube (sealed at one end) into a 
graduated glass column and measuring the volume AY, of water dis- 
placed by a given tube length, L. The outside diameter, D, can then 
be calculated from 


D, = V4AV/rL. 


The inside diameter, D; = D, — 2 X wall thickness. 


3.6 Preliminary Measurements and Tests of Apparatus 


Initial measurements were made to prove the feasibility of the ap- 
paratus. It was necessary to check the seals at each end of the tubes 
to assure that they were watertight. This was done as follows: The 
permeability was measured for samples of different lengths (67.95 
cm, 113.64 cm, and 154.31 cm) of the same material. If the end seals 
are secure, there should be no difference in permeability for these 
three samples because P is a property of the plastic and should not 
depend on the experiment. Fig. 2 gives the results of this test and the 
good agreement in P for three samples indicates the integrity of the 
end seals. These tests were made on low density (p = 0.917) poly- 
ethylene tubing supplied by Hydrawlik, Inc., Roselle, New Jersey. 
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Fig. 2— Permeability as a function of temperature for various lengths of sample. 


IV. FILM APPARATUS 


The film apparatus (Fig. 3) consists essentially of two aluminum 
discs; one on each side of the circular test film. The two discs are 
identical so that it is necessary to describe only one of them. Two 
cavities are machined in each disc: one of these is covered with an 
aluminum plate to form a reservoir for temperature controlled water; 
the other cavity is adjacent to the test film when the apparatus is as- 
sembled. The test film is clamped between the two discs with 6 bolts 
(see Fig. 3). Water, which is to permeate the test film, is put in the 
cavity immediately below the film; electrolytically dried N» is passed 
over the film to expel the permeated water. The “wet” gas is then 
run through the moisture monitor and the rate of water permeation 
is measured. From this permeation rate, and the thickness and the 
diameter of the film, the permeability can be calculated. 

The diameter of the film exposed to moisture transfer is 5.75 inches. 
Several considerations influence this dimension of the cell. The first is 
precision of measurement of water vapor permeation rate. If the diame- 
ter of the film is too small, the rate at which water permeates the 
film will be low and the moisture monitor will be unable to measure 
accurately the permeation rate. On the other hand, if the diameter of 
the cell is large (greater than 8 or 10 inches), it becomes difficult to 
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Fig. 3— Cut-away view of film cell. 


make films of uniform thickness. Moreover, large diameter discs are 
difficult to machine accurately and hard to handle in assembling the 
cell. The choice of diameter is a compromise between these factors. 
Consideration of the sensitivity of the moisture monitor and the 
other factors noted above indicated a 5 to 6-inch diameter film would 
be optimum. A film of this diameter can be pressed to the necessary 
uniformity and a cell of this diameter would not be difficult to machine 
or handle in the laboratory. 

The films were made by compression molding and their thickness 
was measured in two ways. First, the thickness was measured at 
several points with a micrometer and averaged. In the second method, 
a circle, 64 inches in diameter, was cut from the test film and weighed 
on an analytical balance. The density of this material was then meas- 
ured with use of density gradient columns. The volume of this cir- 
cular sample would then be given by the weight, w, of the 64-inch 
circle divided by the density, p, of the material 
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Vols —. 
p 


and this volume would also be given by the relation 
oe Ds 
Vol A Dt 


(where D is diameter of the circle and ¢ its thickness) so that by com- 
bining these two equations and eliminating volume the thickness of the 
sample can be calculated. In one case, the first method (micrometer) 
gave an average thickness of 11.9 mils, and the second method a thick- 
ness of 11.85 mils. This agreement indicates that the micrometer 
measurements, which are uncomplicated and much more convenient, 
would be as reliable and accurate as necessary. 


4.1 Tests of Film Apparatus 


Tests were made to determine the effect of film thickness on meas- 
ured permeability. If the films are extremely thin, say 1 mil or so, 
surface imperfections such as pits could contribute substantially to 
the total moisture transfer rate through the film. These tests were 
carried out with two materials—a sample of low-density polyethylene 
(10- and 12-mil films) and a sample of high density polyethylene (7- 
and 13-mil films). The comparison of the two film thicknesses for each 
material respectively is shown in Fig. 4 and indicates that the per- 
meability is not dependent on film thickness in the range of film thick- 
nesses (9 to 12 mils) used in these experiments. 


4.2 Additional Tests of Apparatus—Comparison of Films and Tubes 


In Fig. 5 the measured permeability for tubes of the polyethylene 
materials are compared with the data obtained on films. The perme- 
ability of the polyethylene films is about 4 percent lower than for 
tubes of the same material, while the permeability of the PE-Butyl 
copolymer film is about 10 to 12 percent lower than that for the 
tube of this material. These differences are probably due to the dif- 
ferences in processing since the tubes were extruded and essentially 
quenched, while the films were compression molded and cooled more 
slowly (about 5°C per minute). The slower cooling anneals the films 
and they become more highly ordered. Because the permeability of 
the highly ordered regions is less than the amorphous portion, the 
permeability of the entire film would be less. 
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Fig. 4 — Measured permeability is not a function of film thickness. 


V. RESULTS AND DISCUSSION 


5.1 A Few Words About Organization 


The discussion will be divided into two categories: the effects of 
chemical composition and of physical parameters. This is an _his- 
torical classification rather than a logical one. In this study it was 
first desirable to measure WVP’s of cable sheathing materials, both 
those in use and proposed. These were all basically polyethylenes but 
some contained varying contents of comonomers, v2z. vinyl acetate, ethy] 
acrylate, and acrylic acid. Polymers of other chemical types (polyester, 
polyvinyl chloride, eé al.) are also of interest because they, as sealing 
and plugging materials, help bear the brunt of nature’s attack on the 
outside plant segment of the Bell System. 

As a consequence of this work, it was found that physical parame- 
ters such as porosity and other heterogeneities influence WVP and 
will be discussed last. 


5.2 Effect of Chemical Nature 


Fig. 6 contains the WVP’s of representative samples of polyolefins 
to give an overall picture of these materials and to orient the reader. 
The point to be noticed in Fig. 6 is the greater WVP’s of low-density 
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polyethylene materials compared to those of high density. The ratio 
of WVP’s of high- and low-density polyethylene ranges from 4 to ¢. 

The water vapor permeability of low-density polyethylene has been 
reported earlier**® as 2 X 10-8, whereas the values found here for 0.92 
density polyethylene were on the order of 10-°. This difference is most 
probably due to the differences in the types of polyethylene used and 
the differences in the thickness of the film samples used. In the previ- 
ous measurements, the sample films were on the order of 1 to 2 mils so 
that surface irregularities such as pits could contribute substantially 
to the overall moisture transfer rate across the film. In the present 
measurements, the sample films were on the order of 10 to 12 mils 
thick, and at these thicknesses surface irregularities, if they are pres- 
ent, would not contribute substantially to the moisture transfer rate. 

The WVP’s of cable sheathing materials (shown in Fig. 7) were 
of the order of 10-* scc/sec-cm-cm Hg. The homopolymers, in general, 
have lower WVP’s than either copolymers or materials containing 
low molecular weight additives. This greater WVP is probably due to 
several factors. Copolymers and additives, even in small amounts, can 
affect molecular and morphological factors such as branching in the 
polymer chain, molecular weight, and crystallinity among others. 
Because all these factors influence permeability, it is not surprising 
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Fig. 5— Permeabilities of films are slightly lower than tubes. 
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that copolymers and other additives should cause an _ increase. 

The WVP of polypropylene is intermediate to that of high- and low- 
density polyethylenes (Fig. 8). This is of interest for two reasons, one 
of them of practical interest, the other academic. In the first place, 
polypropylene and low-density polyethylene compete for use in some 
protective applications. If other factors are equal, advantage should 
be taken of this lower permeability of polypropylene. 

A second point of interest in the findings of polypropylene perme- 
ability concerns the effect of density. In the case of polyethylenes, 
increasing the density decreases the permeability. However, this ob- 
viously does not hold true for polyolefins as a class because poly- 
propylene has a density of 0.91 and yet has a lower permeability than 
low-density (0.92) polyethylene. The lower permeability of poly- 
propylene can be attributed in part to its high crystallinity (about 80 
percent). Crystalline regions are much less permeable*®* than the 
noncrystalline regions, hence, the material as a whole has a lower 
permeability. Moreover, it would be expected that the intermolecular 
friction would be higher in the case of polypropylene which would, in 
turn, decrease the diffusion rates through the amorphous regions of 
the polymer. 


TEMPERATURE IN DEGREES CENTIGRADE 
50 40 30 21 13 


LOW DENSITY PE 
LOW DENSITY PE 
HIGH DENSITY PE 
HIGH DENSITY PE 
HIGH DENSITY PE 


PERMEABILITY scc/sec-cm-cmHg 





103/T- °K-1 


Fig. 6— Comparison of high-density with low-density polyethylene. 
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Fig. 7— Typical cable sheathing materials. 
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Fig. 8— Comparison of polypropylene with typical high- and low-density poly- 
ethylenes. 
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Finally, the measurements of permeability of polypropylene bring 
home another point. In searching for materials for use in environ- 
mental protection, we would like to find an inexpensive material with 
an extremely low water vapor permeability, and if possible, we would 
like to say that this material is impermeable to water vapor. We 
must be brought up short in this search, however, in view of the find- 
ings in the case of polypropylene. Here is a material that is about 80 
percent crystalline so that only 20 percent of the polypropylene con- 
tributes to moisture transfer, and even this 20 percent gives the 
polymer an overall permeability greater than some of the higher 
density polyethylenes. It is not likely that we could obtain a flexible 
usable olefinic material with any higher degree of crystallinity, and 
in this direction polypropylene represents a limit in lowering the water 
vapor permeability of materials by increasing crystallinity. 


5.3 Copolymers 


Interest in copolymers of olefins stems from two areas. In the first 
place, with even small percentages of copolymers such as viny] acetate 
or ethyl acrylate the polymer is much more flexible and less suscepti- 
ble to mechanical failure from stress. In the second application, these 
copolymers are used to compound semiconducting materials by load- 
ing them with up to 40 percent carbon black. 

These copolymer materials have higher permeabilities than the 
straight polyethylene homopolymers. The permeabilities of the acrylic 
acid copolymers are shown in Fig. 9 as a function of acrylic acid copoly- 


ND 





_ 
—_ 


PERMEABILITY x109 scc/sec-cm-cmHg 
ive) 


90 2 7.5 10.0 12.5 15.0 17.5 200 
PERCENT ACRYLIC ACID 


Fig. 9— Increasing acrylic acid comonomer content increases the permeability 
of polyethylene. 
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mer content. These data indicate that the permeability increases with 
added acrylic acid in the polymer chain. The increase in permeability 
with the addition of these comonomers is due to morphological fac- 
tors: the comonomer interrupts the regularity of the polymer chain 
and thereby reduces the crystalline content of the polymer and in- 
creases the amorphous content. The amorphous, disordered regions are 
more permeable; hence, the permeability of the material 1s increased. 
In the case of ethyl acetate copolymers, the permeability increased by 
almost a factor of ten with incorporation of 15 percent copolymer. 


5.4 Oxidized Polymers 


The oxidized materials were made by atmospheric oxidation of 
unstabilized homopolymer. After oxidation and before the film sam- 
ples were made the materials were stabilized with Santonox. 

The permeability of these materials (Fig. 10) indicates that the in- 
troduction of polar groups, such as carbonyl! groups in this case, does 
not necessarily cause an wncrease in permeability. Other factors can 
play a part. Table I compares permeability, density, and carbonyl 
content of these three materials. 

As the carbonyl content goes up the density also increases, and the 
permeability decreases. Winslow’ has shown that this increase in den- 
sity is due to an increase in molecular order in the polymer and be- 
cause the more highly ordered regions are less permeable, the denser 
material would have the lower permeability. This reduction in perme- 

ability with increased carbonyl content is of interest in cable sheath- 
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Fig. 10 — Increasing oxidation decreases permeability of polyethylene. 
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TABLE I—EFreEct oF OXIDATION ON DENSITY 


AND PERMEABILITY AT 25°C 


Percent carbonyl Density 
0 0.944 
0.3 0.956 
0.6 0.964 


Permeability 


2.20(10)-9 
1.57(10)-9 
0.82(10)-9 


ing applications for obvious reasons and more materials of this type 
will be obtained to investigate this effect in more detail. 


5.5 Nonolefinic Polymers 


The results on polyester are compared with low density polyethylene 
in Fig. 11. The permeability of polyester films was reported* several 
years ago as 1.3(10)-® at 25°C. The value measured with the present 
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Fig. 11— Comparison of nonolefinic polymers with a typical low-density poly- 


ethylene. 
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apparatus on polyester commercially available today was 2.3(10)~§ at 
25°C. The older sample of polyester was unplasticized whereas that 
used in the present study was plasticized slightly and because of 
these differences a direct comparison 1s not entirely valid. 

The permeabilities of plasticized poly(vinyl chloride) and poly- 
urethane are also shown in Fig. 11; both are over an order of magni- 
tude greater than typical low density polyethylene. 


VI. EFFECT OF PHYSICAL PARAMETERS 


One is not likely to find pure, homogeneous plastic materials used 
in the telephone plant. In addition to the deliberately added and nec- 
essary heterogeneities such as carbon black, these materials have in- 
advertent imperfections such as pores or solid particles. To give an 
example a cross-section of polyurethane cable plugging compound is 
shown in Fig. 12. The black areas represent pores formed in poly- 
merization and indicate a porosity of up to 20 percent by volume al- 
though, superficially, the material usually appears to be homogeneous. 
Another example of porous material 1s the foamed polyethylene dielec- 
tric used in some coaxial cables. 


6.1 Porosity 

The effect of pores on permeability might not be straightforward: 
permeability involves both diffusion and solubility and although dif- 
fusion would be expected to increase with porosity, solubility decreases, 





Fig. 12 — Photomicrograph (50%) showing pores in polyurethane. 
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because water is more soluble in the polymers of interest than in air 
or gas in the pore. First, the effect of pores on P will be calculated from 
a model for transport properties of heterogeneous, two-phase materials. 
These will then be compared with measurements of the effect with 
foamed silicone rubber. 

If we can calculate the effect of pores on diffusion coefficient D and 
on solubility S, we can calculate the effect on P because 

P(V) = D(V)S(V) (5) 
where V is ‘porosity’ or volume fraction pores. 

In calculating D(V), use is made of models developed originally by 
Maxwell® in connection with the electrical transport properties of 
heterogeneous materials. With the analogies between Ohm’s law and 
Fick’s law, Maxwell’s model can be used to calculate the diffusion 
coefficient of plastics containing small amounts of gas-filled cavities 
or pores. From Maxwell’s work, the diffusion coefficient of a porous 
polymer is given by 


DAD, + 2D, — 2V(D, — D)I (6 
D,+2D,+ V(D,—D.) ’ ) 


where D, and D, are, respectively, the diffusion coefficients of the 
penetrant in polymer and in air (or the gas in the pore). It should be 
understood that the model from which (6) is derived ignores the inter- 
actions between adjacent pores so that for the case of porous plastics 
the value of D would be somewhat low. For polyethylene 


D= 


D, & 10°* em?/see (7) 
and for air 
D, = 10” cm’/sec. (8) 
Neglecting D, in comparison to D,, Maxwell’s relation reduces to 
p(y) = A $2). (9) 
The solubility S(V) decreases linearly with V: 
SIV) = S. — 8,)V + 8,, (10) 


where S, is solubility of penetrant in the pore or in the gas within the 
pore; S, is solubility of penetrant in the polymer. 
Putting these last two equations into the definition for P 


1+ 27) 


PIV) = (8. — 8)¥ + S,]| D, AE (11) 
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For the case of silicone rubber foam S, > S, and (11) becomes 


(ie ZV). 

P(YV) = [dl ~~ V)S4I| D, (1 = V) | (12) 
The factors (I—V) cancel so that P is proportional to V and a plot of 
P(V)/D,Sp vs V will have a slope of 2 and intercept (V = 0) of unity. 
Such a plot is shown in Fig. 13 for foamed silicone rubber and as 
anticipated the measured WVP’s are somewhat greater than those 
calculated from the model. 

As Maxwell noted: 


“When the distance between the [pores] is not great compared with 
their radii . . . other terms enter into the result, which we shall not 
now consider.” 


For the time being, we will follow his 70 year old cue and use the 
result in a qualitative manner only. Pore interaction in moderately and 
in highly foamed materials and its effect on diffusion is the subject 
of continuing research and will be reported on in a subsequent paper. 


6.2 Hffect of Carbon Black 


Two magnitudes of carbon black content were investigated. Those 
used in cable sheath materials (about 2.5 percent by weight) and those 
higher contents (up to 40 percent) proposed for semiconducting sheath- 
ing materials. Fig. 14 shows some representative data for materials 
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Fig. 13 — Measured foam WVP is higher than calculated. 
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containing about 2.5 percent carbon black compared with the natural 
materials. The carbon black at these contents lowered the permeability 
only slightly. Note also, that at these low percentages, the temperature 
dependence of the filled and unfilled material are essentially identical. 

Materials highly loaded with carbon black also showed a linear 
decrease in permeability with increasing carbon black content. Fig. 15 
gives some representative data. Qualitatively, this decrease in per- 
meability with increasing carbon black content is not difficult to ex- 
plain. The carbon black particles are most probably impermeable to 
moisture; hence, their presence in the polymer decreases the volume 
available to moisture diffusion. Again, we can use Maxwell’s model 
discussed earlier to describe this decrease in permeability with increas- 
ing carbon black content. For the case of a polymer interspersed with 
umpermeable particles, Maxwell’s equation for the diffusion coefficient 
is given by 


2D, i V es) 
2 + Veo (18) 


where D, is the diffusion coefficient of water vapor in the polymer and 
Vo» is the volume fraction of carbon black. 
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Fig. 14— Effect of 2.5 percent carbon black. 
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Fig. 15—Effect of carbon black on permeability of poly (ethylene ethyl 
acrylate) copolymer. 


The solubility of the loaded polymers decreases linearly with carbon 
black content 


| S(V.) = (1 — Vis)S>. (14) 
Combining the equation for D and S with the definition of P gives 


2D,8,(1 aA Viale 


2 ae Vis (15) 


P(V 2») = 

This equation is shown as a dotted line in Fig. 15 and is slightly 
lower than the experimental values for WVP. In the equation for 
S(V.») above, it was assumed that the solubility of water on the car- 
bon black was negligible, but as shown below there is some interaction 
and to be precise, we would be justified in assigning some contribution 
to S(V,,) due to the carbon black. This would increase the calculated 
values of P to agree more closely with the data. 

At higher carbon black contents (10 percent and above) the tem- 
perature dependence of filled and unfilled materials is markedly 
different. Fig. 16 gives an example of this behavior and shows that the 
activation energy for permeation, H,, is decreased by carbon black. 
The decrease in H, for the case of water sorption in polyethylene 
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Fig. 16— Effect of adding 41 percent carbon black to poly (ethylene ethyl 
acrylate) copolymer. 


highly loaded with carbon black is not unexpected. For the water- 
polyethylene system, 


D = D, exp (—LE)/RT), (16) 

S = S, exp (—AH/RT), (17) 
and 

P = P, exp (—E,/RT), (18) 


where D,, S., and P, are constants; Ep and EH, are activation energies 
for diffusion and permeation, respectively, and AH is the heat of sorp- 
tion. Substituting these equations into the definition of P (4) gives 


P, exp (—£,/RT) = D,S, exp [—(Wp + AH)/RT]. (19) 
From this 


KE, = Ep + AH. (20) 
For the case of water sorption on carbon black, AH is negative and 


increasing the carbon black to 41 percent gives a greater negative con- 
tribution and £, is therefore decreased. | 
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VII. SUMMARY 


In general, the low-density polyethylene sheathing materials have 
permeabilities on the order of 10-§ scc/sec-cm-cm Hg at 22°C. High- 
density polyethylenes have permeabilities from 1/3 to 1/6 that of low- 
density polyethylene. The permeability of polypropylene is intermedi- 
ate to high- and low-density polyethylenes. Copolymers of polyethy- 
lene (for example, vinyl acetate, ethyl acrylate, and acrylic acid) have 
higher permeabilities than the homopolymer; in one case 15 percent 
ethyl acrylate increased the permeability by a factor 10. 

Nonolefinic polymers, in general, have higher permeabilities; for 
example, polyurethanes have permeabilities more than 80 times higher 
than low-density polyethylene. 

Heterogeneities in the plastic such as carbon black and pores in- 
fluence permeability. Addition of carbon black decreases the WVP 
roughly in proportion to the amount of carbon black. The permeability 
of these materials increases with increasing porosity. 
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Statistical Analysis of the Level Crossings 
and Duration of Fades of the Signal 
from an Energy Density Mobile 


Radio Antenna 


By W. C.-Y. LEE 
(Manuscript received September 30, 1966) 


A theoretical analysis of signal fading using an energy density antenna 
as developed and compared with that from an isotropic antenna. The 
energy density antenna provides a signal proportional to the energy density 
of the mobile radio field. The number of crossings that the signal makes of 
a given signal level and the average duration of fades below a given signal 
level have been derived theoretically for these two cases using a simple sta- 
tistical model. Comparing the number of level crossings of the electric field 
with that of the energy density, 1t 1s shown that the energy density fades less 
frequently than the electric field by at least a factor of two. The average 
duration of fades of the electric field 1s greater than that of the energy density 
only for lower signal levels. These results are in reasonable agreement with 
experrmental measurements. 


I. INTRODUCTION 


The study of signal fading appears to be very important to mobile 
radio systems. When a steady sine wave is sent out from a fixed station, 
the signal received by a mobile receiver in motion fluctuates, or, in 
radio jargon, fades. The received signal fluctuates more rapidly as both 
the frequency of the transmitted wave and the speed of the mobile 
radio increase. For a field recetved by a moving isotropic antenna, the 
maximum fading frequency fz, as Ossanna’ has pointed out, is fy = 
2V/\, where V is the speed of the mobile radio and X is the wavelength 
of the steady sine wave. For instance, at 836 MHz and a speed of 15 
miles/hr, the signal fades at a rate of about 40 times every second and 
is a serious disturbance to the mobile radio communication. 

There have been many investigations of the fading problem. Aikens 
and Lacy” made a test using 450-M Hz transmission to a mobile receiver 
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in an urban area. Bullington’ investigated radio propagation variation 
at VHF and UHF. Young’ pointed out that for the test samples of signal 
strength taken over a small area, the amplitude follows a Rayleigh 
distribution to a fair approximation. Also 8. O. Rice’ pointed out that 
the fluctuations of a received radio signal have the same behavior as 
the envelope of a narrow-band Gaussian noise. Recently, Ossanna’ 
measured the power spectra of a mobile radio fading signal. They all 
treated the signal as obtained from an isotropic antenna. 

In this paper, a theoretical analysis of fading using an energy density 
antenna’ is developed and compared with that from an isotropic antenna. 
The concept of using the energy density antenna to reduce the effect 
of signal fading was suggested by J. R. Pierce.” It will be discussed in 
detail later. The number of crossings n(W) that the signal makes of a 
given signal level Y, and the average duration of fades t(W) below a 
given signal level Y, have been derived theoretically from a statistical 
model using Gaussian random amplitudes and equal angles of arrival 
of an infinite number of incoming waves. The two statistical factors, n 
and t, first expressed by Rice,* can describe the property of individual 
signal fading very well. In this paper, n and t for the isotropic antenna 
will be compared with the values for the energy density antenna. These 
theoretical results also will be compared with the experimental data. 


II. THE METHOD OF OBTAINING THE EXPECTED NUMBER OF LEVEL CROSS- 
INGS AND AVERAGE DURATION OF FADES 


From Kac’s’ and Rice’s® paper, a brief derivation of the expected 
number of level crossings n(¥) of a given signal level Y and average 
duration of fades below a given signal level ¥ is as follows. We assume a 
random function Y which is statistically stationary in time, and for 
which the joint probability density function of y and its slope y is 
p(w, ¥). Any given slope ¥ can be obtained by 


, _ ob 
Y aa r ’ (1) 
where 7 is the time required for a change of ordinate dy, as shown in 
Fig. 1. The expected number of crossings of a random function w in 


the interval (¥, ” — dy) for a given slope y in time dé is 
the expected amount of time spent in the interval 
dy for a given y in time dt 
the time required to cross once for a given y in 
the interval dy at y=V 
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_ Fig. 1— The notation used in obtaining the expected number of level cross- 
ings n(W) and the average duration of fades t(W). 


_ BW) _ pW Davdddt| = pay 
ieee = ip, Vdydt. (2) 
yp at p= 


The expected number of crossings for a given y in time T is 
[ iow, Dadar = doce, Have. (3) 
The total expected number of upward crossings in time T is 
na = 7 [ dpe, Day. (a) 
The total expected number of crossings per second is 
ny) = P= [ wow, was. (5) 


Since the expected number of crossings at a particular level Y per second 
can also be stated as 


the expected amount of time where the 
function wy is below level ¥ in one second 


nv) = 
the average duration of fades below level V 
 PW<y 
hence, the average duration of fades below level is - 
PwW<vwv 
qa) = FY <<), (2) 


n() 
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Hence, the results will be derived from the joint probability density 
function p(y, ¥), and the problem is to derive this probability density 
function for the various signals. 


Ill. THE EXPECTED NUMBER OF LEVEL CROSSINGS AND THE AVERAGE 
DURATION OF FADES FOR A VERTICALLY POLARIZED WAVE 


In order to obtain the expected number of level crossings of a given 
signal level R and the average duration of fades below a given signal 
level R for the three field components of a vertically polarized wave, 
first we need to specify the forms of the three field components. Then a 
statistical model of the field components is assumed. From such a 
model, we find the joint probability density functions of amplitude R 
and its slope R for the three field components. Finally, we use (5) and 
(7) to obtain the result for each field component. 

Following Gilbert’’ a vertically polarized plane wave E, traveling 
in a direction u in the (z, y) plane is assumed. The three field compo- 
nents referenced to a receiver moving with velocity vector V can be 
written 


E, =e, = A, exp (—j8u-Vé) exp (jw!) volt/m 
= n(h, amp/m) = A, sin 6, exp (—j6u- V2) exp (wt) volt/m 


a 
| 


= n(h, amp/m) = —A, cos 6, exp (—j6u- Vi) exp (jwt) volt/m, 


where 6 is a wave number and A, is a complex amplitude of an electric 
wave propagating at a direction u. u is a unit vector related to an angle 
6,, between the positive x-axis and the unit vector itself. 7 is free-space 
wave impedance. The time variation exp jwt can be dropped out of 
three field components for simplifying the derivation. Moreover, from 
now on, we will treat the units of all three components #, , H,, and 
H, in volt/m which will also simplify the calculation. 

When N vertical polarized waves coming from WN directions are re- 
ceived by an isotropic antenna of the mobile radio, the three components 
become 


N N 
E, = 2) A, exp (—jou-Vi) = 2) A, exp [—J8Vt cos (6, — a)] (8) 
N 
H, = >> A,sin 6, exp (—j6u- Vd) 


N 
= >> A, sin 6, exp [—j8Vi cos(6, —a)] (9) 


us] 
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N 


H, = >> — A, cos 6, exp (— jSu: V2) 


u=l 


N 
= — >>) A, cos 0, exp [—j8V# cos (0, — a)], (10) 
u=l1 
where 6, is the angle between the positive x-axis and the direction of 
uth wave u, and0O <S 6, S 27. ais the angle between the x-axis and the 
velocity V, and 0 S a S 27. Both 6, and a are shown in Fig. 2. 

In this paper, a statistical model is used as follows: The complex 
amplitude A, can be separated into a real and an imaginary part A, = 
R, + 78, , hence N incoming waves have N real values of FR, and S,. 
We suppose all those 2N real values are Gaussian independent variables 
with mean zero and variance one. Also, we assume the N waves have 
uniform angular distribution, 1.e., the kth wave u, has an angle of arrival 
6, = 2rk/N. Moreover, in this paper an infinite number of multiply 
reflected waves (VN — o) are assumed for finding the expected number 
of level crossings n() of a given signal level R, and the average duration 
t(R) of fades below a given signal amplitude R. 


3.1 Finding the Values of n(R) and t(R) from the EL, Field 


First of all, we need to obtain the joint probability density function 
of signal amplitude R and its slope F for the electric field component 
E, using the statistical model we mentioned previously. We start from 
(8). The alternate form of (8) can be written as 


E,= > (R,, + 7S,)[cos {8Vz cos (8, — a)} —jsin {BVt cos (6, — a)}]. 
(11) 





V = VEHICLE VELOCITY 


U = DIRECTION OF PROPAGATION OF 
A RANDOM FIELD COMPONENT ° 


Fig. 2— The coordinate system. 
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quation (11) can be separated into real and imaginary parts 


BE,=X,+7"%. (12) 
The real part of E, is 
= : (R,, cos ¢, + S, sin ¢,) (13) 
and the imaginary part of FE, is 
= ¥ (8. cosy, — Resin g), (14) 
where 
vy, = BVt cos (6, — a). (15) 


We assume that all NV values of R and S in (13) and (14) are time in- 
dependent. Then the derivatives with respect to time of (13) and (14) 
are 


N 

X, = BV >) (—-R, sin g, + S, cos ¢,) cos (8, — @) 16) 
N 

Y, = BV >> (-S, sin g, — R, cos ¢,) cos (6, — a). (17) 


The mean values, variances, and covariances of X,, Y, , X, , and Y, are 


m = (X1) = (Yi) = (X1) = (Y,) = 0 


in. = (Xi) = (Y]) = N forany N (18) 
= (X2) = (¥2) = (avy S for N23 (19) 


and 
(X,Y,) = (XX) = (Y,X) = (YiY1) = (XY,) = (XY,) = 0. 


The above results are shown in the Appendix. 

From the central limit theorem, it follows that X,, Y,, X,, and Y, 
are four independent random variables which are distributed normally 
as the value N approaches infinity. The probability density function 
of four independent real random variables X,, Y,, X,, and Y, is” 


p(X, ’ Y, ) x, ’ Y,) 


_ 1 f 1 (= y? , Xi+ ri} 
~ (Qa)” |u|? exp { 2 Ha = iy y 30) 
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where the determinant |u| of the covariance matrix is 


»  N* 4 
lel = (uel) = “7 (BV) 


We may introduce the concept of the envelope 


E,=X%,+ 9Y,=7. /0 . 


The quantity 7, is the envelope and 7, is the phase, both of which are 
slowly varying functions of the time. Then, 

X, =f, COS 7, ‘ Y, =r, sin 7, (21a) 
X, = 7, COS 7, — Pee SIN 7. : Y, = 7, sin Ne + rene COS y, . (21b) 


The Jacobian of the transformation from (X,, Y:1, X1, Y1)-space to 
(Te, Ne, Te) Me)-Space is” | J | = 77%. 

Therefore, the change of variables gives the probability density the 
form 


D(X, ee 5.€ ’ ¥y) 


roq(Te ) Ne y Te y The) 


= P(r. , Ne Te y Ne) 


eel lfrt i. rete + fs) 

7 (2ar)” | Me as (-3 = oe a 
where g(r., 72; 7e, Ne) 18 the density obtained on substituting for 
X,, Y,, etc., their values in r,, 7, , etc., obtained from (21a) and its 
time derivative (21b). To obtain p(r,, 7.), the probability density of 
the envelope and its rate of change, we must integrate over 7, and 7, , 
the range of which are, respectively, (0 to 27) and (— » to +). From 
(22) we obtain 





re 


| & 7 )} 
Vo ple) = f= ak , : 23 
pl VW Qari sta 7 ( Kai © Mii 23) 


It is observed that the expression on the right of (28) is independent 
of ¢. Hence, the expected number of level crossings n(R#,) at a given signal 
amplitude (r, = R,) can be obtained from (5) by using p(R, , 7.) in (28). 











(ove) / R R? 
nh, =f rpik, ,r.jdr. = Ait __“"e__ gy € -). 24 
( ) 0 Pl buy V Qty 1 Qbear eg 
Now the variance of r, is 


(r.) =< (X71) i (V3) = Quy, = 2N. 
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Let | 
R, a R,/ V(r?) =; Tis Tics oa R,/ V2N. (25) 


Substituting the values of variances u,, and y{, from (18) and (19) into 
(24), also applying the relations in (25), we obtain 


3, <, BVH : 
nh.) = R, exp (—f2). 26 
Equation (26) is plotted in Fig. 3 where the abscissa is 2, in dB (20 log 
R.) and the ordinate is (1/27/8V)n(R,). 
The average duration of fades t(R,) of FE, can be obtained as follows: 


The probability that the envelope r, is less than a given amplitude level 
R, is 





Pe) <R) = | peo)ar.d, (27, 


' 


BV/Venr 


e n(¥,) THE EXPECTED NUMBER 
OF LEVEL CROSSINGS OF FADES 
OF TOTAL ENERGY DENSITY/SEC. 


a (Re) THE EXPECTED NUMBER 
OF LEVEL CROSSINGS OF FADES 
OF THE ELECTRIC FIELD/SEC. 


THE EXPECTED NUMBER OF LEVEL CROSSINGS, SEC 


( 


n 
No 





SIGNAL LEVEL IN GB WITH RESPECT TO RMS VALUE 


Fig. 3— Comparison of the level crossing rate of the electric field with that of 
the energy density. 
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where p(r,.(t)) is the probability density obtained from p(X,, Y,). By 
changing the variables, we obtain p(r.(t), 7.(é)) from p(X,, Y,). Then 
we integrate p(r.(t), n.(t)) over from 0 to 27 to obtain” 


p(.(0) = 2 exp | HO], (28) 


211 
where p;; 1s obtained from ie anaes (28) into (27) we get 


P(.) < Rk.) = 1 —- exp (—R?/2u) = 1 — exp (—R). (29) 


The expected number of times per second that r,(t) passes upward (or 
downward) across the level R, is obtained from (24). The average dura- 
tion of fades during which r,(t) < R, may then be obtained by sub- 
stituting (24) and (29) into (7) 

Pr.) <R) PRO <R) V2r 1 


t(h.) = "ay aa [exp (#2) — 1] (30) 


which is shown in Fig. 4. 


3.2 Finding the Values of n(R,,) and t(R,,) from the H, Field 


Following the same steps as above, we are going to find the joint 
probability density function p(r,, , 7,2) of the envelope 7,, and its slope 
nz Of the H, field component first. From (9) we find the real and imagi- 
nary parts of H, which are expressed in the Appendix. The means, 
variances, and covariances of four real Gaussian random variables 
shown in the Appendix are 


Mz = (Xx) = (Y2) = (X3) = (Y,) = 0 


I 
I 


olz wile 


xo = (BV)° 


ot 


M32 
-[cos a + 3sin’a] for N=83 and N= 5d 
(Xo5Va) = (Xi Xs) =e (XY. = (Y5Xs) = (YY) = (XoY.) = QO. 


The probability density of the envelope of H, field and its rate of change 
p (Tre » Taz) 18 then obtained by following the same procedure used in 
deriving p(r. , 7). 


D(The » The) = Bae exp ‘4 = of it) (31) 
ee V Qarp32 Hee 2 \uoe M92 
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10> 





; 


e t(¥,) THE AVERAGE DURATION 
OF FADES BELOW A GIVEN LEVEL 
¥, OF THE ENERGY DENSITY 


t(Re) THE AVERAGE DURATION 
OF FADES BELOW A GIVEN LEVEL 
Re OF THE ELECTRIC FIELD 


ro) 
nN 


10! 


10° 


THE AVERAGE DURATION OF FADES BELOW A GIVEN SIGNAL LEVE 
Y27 /BV 


( 


-20 =15 -{0 -5 O 5 10 
SIGNAL LEVEL IN dB witH RESPECT TO RMS VALUE 


Fig. 4— Comparison of the duration of fades of the electric field with that of 
the energy density. 


Hence, the expected number of level crossings n(R,,) at a given signal 
amplitude (7,, = R,,) can be obtained from (5) by using p(Raz, *rz) 
in (31) | 


° . 3 R Zz R;., 
n(Rrz) = i} Pn2P (Rr. } Pn2)AP he = Ne (= exp (Bs) i (32) 
2 22 


The variance of 7,, 1s 


(riz) = (X>) + CY) = Quo = N. 








Let 


fn, = SRS (33) 
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Substituting the values of the variances 2. and yZ, into (82) and replac- 
ing N by (7%;,,) we get 


2 + 2 
WES = <E ea Ses Pp =e 


BV 
V 2r 


Equation (34) is the same form as (26) except for a multiplying factor 
which is a function of a shown in Fig. 5. Hence, n(#,,) is also a function 
of angle a. Thus, when the mobile is moving along the z-axis a = 0 or 
7, and 





V1 — 3 cos 2a &,, exp —R;, . (34) 


_ 6Y_ 1 
V2r V2 
which is the minimum value of n(#,,). When the mobile is moving on 


+y-axis a = +7/2, and 
- BV a . Be 
Riz) = —— ala fiz exp —FRiz 34b 
n(fn \/ g tua p h (34b) 





n(R.) R,, exp —Ri, (342) 


which is the maximum value of n(R,,). The ratio of level crossings for 
these two cases is 


n(Bin2)(@ = 0°, 180°) _ 1 
n(Riz(a = +90°) V3 


(34c) 





“wo 
ea 
— LOOP WHICH 
N RECEIVES Hy 
— 0 ENERGY ' 
B DENSITY ~ 
x 0. ANTENNA \ 
Cc 
0.6 
0.5 TOP VIEW OF_-7 ‘LOOP WHICH 
THE VEHICLE RECEIVES H 
0.4 3 
0.3 
180 165 {50 135 120 105 90 
e) 15 30 45 60 75 90 


@ IN DEGREES 


Fig. 5— The effect of the angle a on the ratio of level crossing rates of the 
electric field to the z-component of the magnetic field. 
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For a = +45° and +185°, 





nRrx = Fak, €xp —f?. (34d) 


which is the same expression as (26) for R, . However, although (34d) 
and (26) are the same form, the magnitudes R, and R,, are different, 
since (r?) in (25) is equal to two times (77) in (88). 

The average duration of fades t(R,,) of H, can be obtained without 
difficulty. It is easy to prove that the expression for the average length 
of the intervals during which 7,, < f,, will have the same form as 
t(R,.) in (80), except for a multiplying factor that depends on a, as 
follows: 


V 27 1 











t(Rie) = BV ara r i [exp (Riz) — 1). (35) 
When a = 0° and 180°: 
t(f.) = a V3 = [exp (%,) — 1] (35a) 


which is the maximum value of t(f,,), and when a = +90°: 


t(2,.) = Von 2 [exp (Ri.) — 1] (35b) 


which is the minimum aaite of t(F,,). The ratio of level crossings for 
these two cases 1s 





t(f,.)\(« = 0°, 180°) 


2 = V3 (35¢) 
t(B,.)(a = +90°) 


which is the inverse of (34c). This tells us that when n(R,,) reaches a 
maximum value, the average duration of fades reaches a minimum value 
and vice versa. 

For a = +45°, +185° 


m ar 
which is the same form as the expression for t(#,) in (30). We may say 
at these angles a = +45° and +135°, the £ field and the H, field have 
the same average duration of fades below the level R, = B,,. 
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3.3 Finding the Values of n(R,,) and t(R,,) from the H, Field 


Similarly, we obtain n(R,,), the expected number of level crossings of 
a given signal level R,, from the H, field. It is very easy to see that 
n(R,.) and n(R,,) are the same forms of distribution as expressed in 
(34), except for the multiplying factor that depends on a. The average 
duration of fades t(R,,) during which 7,,(¢) < R,, is also of the same 
form as t(R,,) in (35) except for the multiplying factor depending on a. 
The variances p33; and yf, are given in the Appendix. We may thus write 
directly 


BV 
V2 


i V 2 1 1 ss 
ee N Vicia, Oe 


n(Biy) = V1 + 4 cos 2a R,, exp —R;, (36) 








where 








It is obvious that 
Te. eae ae = n(Rrz a=+90° 


n(R,,) a2490° n(Biz) a=0°,180° 


when R,, = Riz. 


IV. THE EXPECTED NUMBER OF LEVEL CROSSINGS AND THE AVERAGE 
DURATION OF FADES OF THE SIGNAL FROM AN ENERGY DENSITY 
ANTENNA 


J. R. Pierce’ has suggested utilization of the energy density concept 
as a possible means for reducing the signal fading in mobile radio. If 
we pick up the electric field e and the magnetic field h in free space and 
amplify the two fields by their appropriate relative gains, square and 
add these two fields, we obtain a signal proportional to electromagnetic 
energy density 


W = he’ + uh’), (38) 


where e¢ is dielectric constant, and yw is permeability. This idea can be 
realized by using a special antenna’ which receives three field compo- 
nents e,, h,, and h, simultaneously. The three signals enter separate 
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square-law detectors, and the three detector outputs are added to obtain 
the energy density, 


W = 3¢ lel’ +o |he|’ + w |hy|’). 


We may express W in a different form 


a LE 2 hed 2 Le 2 2/2 
W = 5 (ec ++ : JA.{ + : lh, | ) vot [mi 


3 (B.) (Wolt/m’) + |H.) (wolt?/m’) + |H,/° (volt?/m’) 
- 5 [v.(volt'/m’)] = x Joules/m’. (39) 


We define y; as a normalized energy density 


v, = |E,|’ + |H.|? + |H,|’ volt?/m’ 
= Ve I Whe + Vny (40) 
=(Xit+ Y)o+ (4+ Y¥)+ (44+ Ys). . (41) 


Gilbert’? has done some work on finding power spectra in energy re- 
ception for mobile radio. His work provides very useful background 
for this paper. 

In this section, we are attempting to derive the number of crossings 
n(¥,) at a given level of signal magnitude WY, using (5) in Section II. 
Virst of all, we need to find the joint probability density function 
o(v,, ¥,) of signal y, and its slope ¥,. Since y is a function of 
(X,, Y1, X2., Y2, Xs, Ys), and ¥, is assumed to be a function of 
(W., Wax, Wry), we will find out that the variables (X,, Y,, X., Y2, 
X;, Y3;) and (., War, Way) are two independent Gaussian variable 
groups. Then, 


plwi(X., V1 ,X2, Yo,Xs, Vs), Wily. s ae s Pv) 
= plwi(X., Y1,X2, Yo2,Xs, Vs)] X pivild. » Ve » Yrv)] 
= p(p.)p(y). (42) 
A brief sketch of the method of finding p(y,) and p(y,) is discussed below. 
4.1 To get p(y) 
Since we know from (41) that 


W, = (XI + Yi) + (Xa + Yo) + (&3 + Ys) 
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and since X,, Y,, --: are independent Gaussian variables, it is easy 
to get p(y¥,) from the Fourier transform of the characteristic function 
M,,(v), where M,,(v) = My .My.My My. +--+ etc., and we can get 
these M’s very easily. 


4.2 To get p(v,) 


All the terms in the summations of equations (79), (80), and (81) 
which represent ¥,, ¥,, and ¥,,, respectively, in the Appendix are 
statistically independent. Then by the central limit theorem these 
three variables ¥., ¥,,, and y,, are Gaussian distributed. Hence, the 
joint probability density function p(¥. , Yaz , Way) can be established. 
Since ¥, = ¥. + viz + vn, We can get p(¥,) from the Fourier transform 
of the characteristic function M/;, , but 47;, must now be obtained from 
the general definition 


Myo) = Ble} = fff ead. das Vldbddnd yay (48) 


since we have no simple way of getting /7,, , Wj,, and 1/7;,, separately. 

Let us introduce a new variable e which can be any one of the above 
Gaussian random variables. It has a zero mean and variance p. Then 
the probability density function of the square € is“ 





py = €) = Sm exp (-3 7) (44) 


for y > 0. The characteristic function corresponding to this probability 
density 1s 


MG) = [paddy = 1 ~ Pay, (45) 


From the Appendix we know all six variables X, , Y,, X2, Y2, X3 and 
Y; are independent Gaussian variables. It is not hard to see that the 
Xi, Yi, X:, Y3, X3, Y3 are independent variables by obtaining 
o(X?, Yi, X:, Y3, X3, Y3) from the Jacobian of the transformation” 
of p(X,, Y1, X2, Ye, Xs, Ys). Then X{ and Yj have the same char- 
acteristic function (1 — 2ju.,v)*. Also X2, Y2, X?, and Y? have the 
same characteristic function (1 — 2"~..v)”*. Thus, by the addition theo- 
rem” the sum y, which is defined in (41) has the characteristic function 


I 


MED) ae Orn. OL Og 


(46) 
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Then the probability density function p(y,) can be obtained from the 
Fourier transformation of the characteristic function 


|e gages 
rb) = 5- [My Giddy 


z= “| 

= eee 26 weer exp (—¥.) — exp (< We )| ais 4 ie ( Zbo2/ 

2 (tee = a 2h11 2p22 A too([es — Mi) 

(47) 

The joint probability density p(y. , wiz, w1,) has been derived in 
the Appendix 


p(t. ’ Wre ’ Way) 








1 72 j2 
= Gala i exe (3 Au | dE + | Aas | vie + | Ass | 


+ 2 | Kis | WeWre -+ 2 | Ais | weWny + 2 | Ags | WreWnu) ty (48) 


where [A] is the covariance matrix of ¥, , Ys. , Yn, and the |A,,,| are the 
cofactors of |A|, given in the Appendix. From (40), ¥, is the sum of 
the three random variables ¥,, ¥,., and y,, . Then the characteristic 
function for ¥; is 


M , (jv) = Li{exp {wb + va, + Vay} ] 





ae If exp {juve = Whe + Wav) |D(We ’ Wrz ’ Vind bedvr.d Wn, P 
~ (49) 


The details of this computation are given in the Appendix with the 
result (92) 


M,,(j0) = exp {—40*}, (50) 
where 
Pt = pir + p22 + pas + 2pis + Zpis . (51) 
The probability density of total Y, is then 


° _ af ; ~foy, — we (-4 :. 
p(y.) = an ae M, (pye dv = n/p! exp 20 VY (52) 


The joint probability density p(y, , ¥.) of Y, and y, can now be ob- 
tained by substituting (47) and (52) into (42). 
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The expected number of crossings n(¥,) at a given signal level y, = 


Y, made by the total energy density signal in one second can be ob- 
tained from (5) 


n(W,) = i vip(y, ’ v)dy, = p(¥,) i vip dy, 


ast Vy Mi oe . _ WV, _ (— V, ) 
VE oo = Has)” [exp ( 2h11 me 2 M22 


Y, exp —W,/2Qp09 
4 U29( Mos > fia) 














» (98) 


where p/ is given by (51) and w,,; = N, wo. = N/2 as shown in the Ap- 
pendix. 
Also we know from Gilbert’® 
(Wi) = 22N". 
In addition, pj = pty + pa. + p33 + 2et2 + 2013 = 2N°(8V)° and 
letting 


= Y YF 


V y*) Wt (rms) 


we can simplify (53) as follows: 


nV.) = -— {2-V9 exp e= ¥,) — exp (- VEt, | 








— 2V/11%, exp (— vi}. (54) 


Equation (54) is a distribution which is independent of the angle 
a. When VY, = 1, it means that Y, is equal to its rms value. Equation 
(54) then becomes 


n(W)z-. = ce xX 0.1839. 
27 


a/ 
Also let R, = 1 in (26): 
BV 
WV or 
It is shown that the expected number of crossings of the total energy 
density is one half the expected number of crossings of the envelope of 





n(R.)z.-1 = X 0.3678. 
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E field at their rms levels (¥, = #2). In other words, the energy density 
fades less frequently than the F field by at least a factor of 2. 


n(W,) Ss an(le,) 


for ¥, = R? with respect to their rms values. The theoretical values 
of n(V) and n(#,) are in Fig. 3. 
The average duration of fades below a given level W, is given by (7) 


Ppt) < WV) 
n(W,) 


where P(y,(t) < W,) is the probability function obtained from p(y, ()). 


t(V,) = (55) 


PID <¥) = [ pOMD)aY 





= 1—4 exp es t,) + (38 + V 22%) exp (— V224,) ’ (56) 
where 
m Y, 
Vv, = ; 
Wecems) 


Substituting (54) and (56) into (55), we obtain the average duration 
of a fade below a given level ¥, : 


V/2r 1 





t(W,) a 


BV /2 

| 1 — 2 exp e= Y, +] exp (— V/224,) | 

9) ———_  - 1 

latexp pe Y , —exp (— V/228,)]— 0/228, exp (— V228,) f 
(57) 

When ¥, = 1, (57) becomes 

: V2 
tW,) = BV xX 3.74. 


When &, = 1, (30) becomes 


V 29 
BV 


It is shown that the average duration of fades of the energy density 
below a level V, is larger than the average duration of fades of the £ 


t(#,) = 
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field below a level R, where Y, = KR? = 1. The curves of t(¥,) and 
t(#.) have been plotted in Fig. 4 for comparison. 
V. DISCUSSION OF THE THEORETICAL RESULTS 


From the above derivation we know that y, and y, are two independent 
variables as shown by (42): 


DY ) v1) = p(v,)p(v,), 


and therefore (5) can be written as follows: 
nb) = pee) | valdad 
= p©,) {%} (58) 


where {y,} represents an integral. Equation (58) simply shows that 
the expected number of crossings n(¥,) at given level Y can be obtained 
from the probability density of level V, times the integral {y,} . The 
average duration of fades, then, turns out to be 


—_ Ph, < Y;,) 
oe n(W;) 
1 PY. < ,) 
= 59 
(eI p&y we 
We emphasize that (58) and (59) are valid only when y, and y¥, are two 


independent variables. 

The two curves, n(#,) and n(¥,), are plotted in Fig. 3, normalized 
by the common factor 4/27/8V. Both curves are plotted as functions of 
the signal level normalized to their own rms values. The value of n(B,) 
is, as shown, always higher than the value of n(W,) for any signal level. 
From these two curves, it may be said that the fading of the energy 
density is less frequent than the fading of the envelope of the electric 
field. The maximum expected numbers of crossings of both n(R,) and 
n(¥,) are at the —3 dB level, which means for signal level at 1/V2 and 
4 of their rms values, respectively, we will count the most fades. The 
curve of n(W;) has dropped faster on both sides of 0 dB than the curve 
of n(R.), which means that the range of the signal amplitude y, is 
less than the range of the signal amplitude r, . 

The average duration of the signal below a given amplitude level 
is another way of looking at the fading problem. Fig. 4 shows that the 
average duration of fades of the energy density t(W,) is always larger 
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than the average duration of fades of the electric field t(R.,) when the 
given signal levels are above —3 dB with respect to their rms values 
(Y, = R? > —3 dB). The value of t(¥,) is less than t(#,) when the 
given signal levels are more than 3 dB below the rms values ¥, = R? < 
—3 dB. When the given signal level is at —3 dB (¥, = R? = —3 dB), 
the average duration of fades t(—3 dB) of both the energy density and 
the electric field are the same. 


VI. COMPARISON OF THE THEORETICAL PREDICTION WITH THE EXPERI- 
MENTS 


The three field components , H, , and H, have been received by a 
special antenna’’’ mounted on a mobile van moving at a speed of 15 
mile/hr. All the figures shown in this section were taken on Common- 
wealth Avenue, New Providence, New Jersey, from a transmitting 
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Fig. 6 — Comparison of the predicted level crossing rates to the observed rates 
for the electric field on Commonwealth Avenue, New Providence, New Jersey. 
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Tig. 7— Comparison of the predicted average duration of fades to the observed 
average duration of fades for the electric field on Commonwealth Avenue, New 
Providence, New Jersey. 


antenna at 8386 MHz at Bell Laboratories, Murray Hill. After adjusting 
the appropriate relative gains of the three fields, the energy density 
can be obtained by squaring and summing these three fields by computer 


Y= |B se | ee volt’/m’. 


Since the distance between the transmitting antenna and the mobile 
unit is relatively short, the angle swept out by the radius vector from 
the base station to the mobile unit varies considerably over a typical 
length of run. To reduce the variation of this angle the data for the 
entire run were cut into sections 8 seconds long, corresponding to 175 
feet of travel, for computer processing. Each section, either the en- 
velope r, of the £ field or the energy density y, , was used to obtain the 
number of level crossings n and the average duration of fades t by com- 
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Fig. 8— Comparison of the predicted level crossing rates to the observed rates 
for the energy density on Commonwealth Avenue, New Providence, New Jersey. 


puter program. However, since the experimental curves of n and t were 
almost all alike for all sections, we used only one for comparison with 
the theoretical curve. 

Fig. 6 shows a comparison of the curves of the expected number of 
crossings n(i2,) at any level &, for both experiment and theory. The 
shape of the experimental curve is in fairly good agreement with the 
theoretical curve. Since the receiving antenna on Commonwealth 
Avenue is in line of sight with the transmitting antenna at Bell Lab- 
oratories, a small direct wave component may be introduced. This 
small direct wave component is not considered in our theoretical analy- 
sis, hence the values n(#,) from the experiments should be less than the 
theoretical results as we would predict. 

Fig. 7 shows a comparison of the curves of the average duration of 
fades t(R,) for both experiment and theory. They are quite alike. Since 
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a small direct wave component does exist, the average duration of fades 
for the experimental data should be higher than the theoretical results. 

Fig. 8 shows a comparison of the curves of the expected number of 
crossings n(¥,) at any level ©, for both experiment and theory. The 
shape of the experimental curve 1s very much like the theoretical curve. 
It shows that the theoretical model used in this paper is quite acceptable. 

Tig. 9 shows a comparison of the curves of the average duration of 
fades t(¥,) for both experiment and theory. The difference between the 
experimental curve and the theoretical curve may be caused by the 
small direct wave component. A small direct wave component intro- 
duced into our theoretical model may cause a little higher average dura- 
tion of fades than it might expect, but does not affect the number of 


level crossings. 
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Fig. 9— Comparison of the predicted average duration of fades to the observed 
average duration of fades for the energy density on Commonwealth Avenue, New 


Providence, New Jersey. 
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VII. CONCLUSIONS 


Comparing the expected number of level crossings and the average 
durations of fades of the energy density with that of the # field, we see 
that the fading of the energy density is much less severe than the fading 
of the envelope of £# field. 

Referring to Fig. 5, which shows the fading rate related to the orienta- 
tion of the energy density antenna and the direction of vehicle motion, 
we see that when the two orthogonal loops are at 45° to the direction of 
motion, the fades of all three field components are the same. When one 
loop is lined up with the direction of motion and the other normal to 
it, the H field component received from the loop normal to the motion 
has less fading than either of the other two field components. 

The expected number of crossings/second of fades at a given signal 
level, n, for both R, and WY, is proportional to the carrier frequency f, 
and the mobile speed V, as shown in (26) and (54). They have the com- 
mon factor, (6V/~/2xr = V/2r(Vf./c), where c is the velocity of light. 
Hence, if either V or f, goes higher, n becomes greater. 

The average duration of fades, t, is inversely proportional to the 
carrier frequency f, and mobile speed V, as shown in (380) and (57). 
Hence, if either V or f, goes higher, t becomes smaller. 

The foregoing theoretical analysis is based on a Gaussian model and 
does not include a direct wave component. Even so, this analysis is 
compared with the experimental results in Section VI with fairly good 
agreement. 
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APPENDIX 


A.1 Finding the mean values, variances and covariances of nine variables 


CX 4 Mead ope any Log Vee ivy» 


From (8), (9), and (10) we may express in the following forms 
H,=X%,+ 9%, 
H, = X,+ jY> 
H, = X3;+ 43, 


where 


also 
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N 
X, = DR, cos &, + S,sin &, 
u=l1 


N 
ee > S, cos ®, — FR, sin ®, 


u=1 


N 


X, = >> (R, cos &, + S, sin &,) sin 8, 


u=l1 


N 
a » (S, cos &, — Ff, sin ®,) sin 6, 
u=1 


N 

gS. = > (R, cos &, + S,sin ®,) cos 6, 
u=l1 
N 

Ve =, pa GS, cos &, — FR, sin ®,) cos 4, 
u=l1 


®, = BVi cos (0, — a) 


44] 


(60) 


(61) 


(62) 


(63) 


(64) 


(65) 


(66) 


and the angles 6, and a are shown in Fig. 2. The time derivatives of 
X,,Y,,X., Y., X3, and Yz are 


xX, 
Y, 
Xs 
Y, 
X; 


Ys 


= BV >) (—R,sin &, + S, cos &,) cos (6, — a) 


= BV >* (—S, sin 6, — R, cos ®,) cos (0, — @) 


I 


I 


BV b> (—R, sin &, + S, cos ®,) sin 6, cos (6, — a) 


BV > (—S, sin &, — R, cos ®,) sin 6, cos (6, — a) 


(67) 
(68) 
(69) 


(70) 


—BV >- (—R, sin &, + S, cos &,) cos 0, cos (6, — a) (71) 


—BV > (—S, sin ®, — R, cos @,) cos 6, cos (9, — a). (72) 


The mean values of all above random variables are zero (Xi) = (Yi) = 
(Xo) = (Yo) = (Xs) = (Ys) = (Xi) = (Yi) = (Xa) = (V2) = (Xa) = 


(Y;) = 0. The variances of all above random variables are 
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Mi = (X7) = Oe = N 


tng = (X23) = (YD = 
tas = (X= (YD =D 
1 =O) =) = x 
ihe = (X2) = (¥2) = F 
Ms = (2) = (YA) = SZ 


Remark: The values of yj, 
mations of sine and cosine 
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(73) 

forany N (74) 

(75) 

(BV)° for N,= 3 (76) 

(8V)*[cos’ a + 8 sin’ a] N = 8 (77) 
for 

(8V)*[3 cos’ a + sin’ a] N = 5.(78) 


for N = 3 is derived as follows: The sum- 
functions can be expressed 


N cos = — cos (2N + 1) = 
2 2 
> sin kx = 
sa 2 sin = 
2 
. sin (2N + 1) = —sin~ 
2 2 
>> cos ka = ———_——__—.. 
— 2 sin = 
2 
Then in (67) 
2rU 
0, = N 
and 
ss 2 . 1 = Ag 
> cos’ (0, —a) == >, 1 + c08 (u #* — 20) 
u=l 2 u=1 N 
aN 4 20820 5 4, dx, sin 2a 5 . An 
oer g 2, cosua +5 2, sinus, 
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But, 
: 27 . 2 
N sin (40 -+- =) — sin = 
Ds cos U dor = 5 Sere eee 3 — 0 
ale nS 2 sin zis 
N | 
and for N = 38. 


Qn 2a | 
Sain utt a cos (4r + 2 a4 
u=1 N 27 
2 sin => 


N 


Thus, the average value of X7 or Y? is 


N 
Y cos? (2% — a) => for N = 8. 


Following the same derivation, we obtain the valid range of N for yf, 
and yu, . Later on we will obtain the range of N for pf, , ps2 , 953 , plo , pls 
and pj, in the same way. 


QED. 


Now we are going to find the relations between all the six variables 
X,, X2, X3, Y1, Y2, Ys and their time derivatives. Since we know 
if two variables a and 6 are Gaussian, and also uncorrelated, (ab) = 0, 
then a and b are independent.*° Therefore, the covariance of X,, Y;, X1, 
and Y, are 

(X1V1) = (X11) = (XV) = (YM) = (ViVi) = (XV) = 0 
hence, the four variables X, , Y, , X, , and Y, are statistically independ- 
ent. The covariances of X,, Y., x , and Y, are 

(X,Y,) = oe io = = (X2Y%) = (Y,X.) = (Y2 Y) == (XGVs) = 0 
hence, the four variables X,, Y., x 2, and Y, are statistically independ- 
ent. The covariances of X3, Y3;, X3, and Y3 are 

(XsVs) = (XsXs) = (Xs Vs) = (VsXs) = (Vs¥s) = (%Ys) = 0 


hence, the four variables X; , Y; , X; , and Y; are statistically independ- 


ent. 
Also we may show that 


(XnY,) = 0 for all m and n 
(X,,A,) = (Y,Y,) = 0 form # n, 
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where 


Mia 12, 
nN 


hence the six variables X,, X., X3, Y1, Y2, Y3 are independent. The 
rate of — of energy densities of three fields E,, H,, and H, are 


= 4, = f+ v2 


= BV >> [-(R.B, + S,S,) sin (6, — 4,) 
+ (SR, — R,S,) cos(®, — &,)] X [cos (A, — a) — cos(@, —a)], (79) 
; d d 2 2 
Whe = dt Wie = ai 2 + Y3) 
= BV >> [-(R.R, + S,S,) sin (6, — ,) 
+ (S,R, — R.S,) cos (®, — ®,)] 
-sin 6, sin 6,[cos (6, — a) — cos(@, — a)], (80) 
d ee 2 
Wry oa dt Wry a ai 3 a Y3) 
= BV >> [-(R.R, + S,S,) sin (6, — &,) 
+ (S,R, — R,S,) cos (®, — ®,)] 


-cos 6, cos 6,[cos (6, — a) — cos (6, — a)]. (81) 


The only terms that exist in (79), (80), (81) are those for which u # ». 
There are N(N — 1)/2 different terms which are all statistically in- 
dependent in (79), (80), and (81). Hence, by the central limit theorem, 
ve, Yay, and yy, are Gaussian random variables. 

The variance of ¥, , ¥,z and yy, are 


IV 


pir = (Wt) = (BV) 4N(N — 1) for N = 3 (82) 


a 


pie = (Wiz) = (BV) ) [cos’ a + 3 sin’ a] . _ 3 (83) 
for 


pis = (bi) = (BV) vor =» [3 cos’ a + sin’ a] N25. (84) 
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The covariances of ¥. , Ya: , and yy, are 
ee ean ee oot fees 2 - - 2 s 
Piz = Pa = (Wear) = (BV) 2N(N 1) sin o| for N > 3 (85) 
pis = pn = (Webn) = —(BV)'2N(N — 1) cos’ a (86) 
3s = p32 = (WarWay) = 0. (87) 


It is very easy to show from (60) to (66) and (79) to (81) that the covari- 
ances of the variables between two groups (¥., Yaz, Way) and (X,, Y,, 
X2, Y., X3, Y3) are zero. We may write 


(YmXn) = (tm¥,) =0  foralln and m (88) 
" 2159.3 
n 


hence (W. , Wiz , Wry) and (Xi, Y,, X., Yo, X3, Y3) are two independent 
variable groups.’° 7 


A.2 Derivation of M;,(jv) in (50) 


The mean values of all three Gaussian random variables ¥., yi. , 
and y,, we observed from (79) to (81) are zero. Also (87) gives the 
covariance (W,.Wi,) = 0. The joint probability density function of three 
variables ¥, , v2, Yr, can be obtained”’ 


oe 1 1 - 
D(ve s Wiz» Vay) = On! [Ap oP ‘ah An | We + | Ase | dre 
+ | Ass | Piy $2 | Aro | dedae + 2 | Ars | bedas 


ogee | Ags | dat) ) 
where 


[A] is a covariance matrix, and 
|Ann| is a cofactor of p,., in the covariance matrix [A] 


Pit Piz pis 
[A] aay Pie ps2 0 (89) 


pis O p33 
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|A| = determinant of [A] = 43(pfips. — p12) — phopls 


| Au | = piepis 

| Aoo | = Pi1p33 — Big 

| A33 | = pi1p22 — pir 

| Ais | = | Aor | = —piopis 
| Ais | = | Asi | a — p22P13 
| Aes | = | Aso | = pispie . 


From (38), ¥, is the sum of three Gaussian random variables ¥. , Yaz , Wav : 
v, = wv. + Daz + Wry - 
The characteristic function for y, is then 


M ;,(@) = Elexp {whe + Vie + Viv} ] 
= [ a [ exp {We + Whe + Vv) ln(. ’ Whe ’ Vindb d bred ry 


an 1 7 . ju( bet Wrz) 
~ GaP A PPI oe Joe” 


exp {~ yt ad sl 
ioWhy a | Ass | ( j 2 (| Ais We + | Acs Was) ; )} 
x [e exp { 9 | A | Why a2 | Ass | Why 
Api dyad, . (90) 
The last integrand of w, is 
exp i Aus Vv, + Aos | Whe v ae A33 | ( Ass ve + Ags vee) 


| Ass | 2) A| “Tas | 


es : : | A 13 | ¥ e | Ao Mi 
a - eee [ta LS, + [ta LY) ads 
2| A | ; 

hs (inf Ais | ve + | Acs | vas) 


| Ass | 
4 | Aas | ( Ais | ve + | Acs | tas)" 


a | A | | Ass | 
x [ exp (ie — hs ae 
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From Cramer,*” p. 99 we obtain 


oO h, 9 2 
| ex (ive — Be ‘Nag = fs x - 5, 


h, _ | Ass | 


| A | 


Then following the same techniques we find 


where 





My (Q) = (Qa) | A; Pei 2 (ink - ha eae 
° ; 2 42 ss ‘ Visio 
r exp a = he eae, - exp (rs — he elas, 
eee See 2 jes e L (ui st) } 
~ Qn) PA [F Vighohs LP L 2 Vy ee 24 » (91) 
where 


h, = LAse 


| A | 
B 
hy = —— 
[ A] | Ass | 
h Sede eee Ge, IPI 
© [ A[ | Ass | B 
Vv, = VU 
» = (1— +41), 
| Ass | 
7 | Ais | | Ass |\ A 
v3 = 1—- = v 
| Asy | | Ass | 
A =| Ayo || Ass | — [ Ais | | Aes | = —pte | A | 
B= | Moe | | Ass | — | Aes |? = pts | A | 
CS) Wail Age Se hag She (| 


BC — A® = | Ags | | A |’. 
The constant value outside the bracket of (91) is 


1 (Qn) PIaaeer 
Qn! [Al Viwkoks TA v BC & 








—w 
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and the expression inside the bracket of (91) is 


op ~4| 2 4 aa 


= exp |-3 LM 1 + UAnt— Aad 


2 | A33 | 
A? | 
|| As | ra | Ais | — (| Ass | = | Acs yAls 
i a eh Se 2 
1 BC — A? ‘ | 
= exp {—3(pl1 + poo + p33 + 2pte + 2p!,)v"} = exp —4pw’. 
Thus, 
M ,,(j~) = exp —dpn". (92) 
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Some Simple Self-Synchronizing 
Digital Data Scramblers 


By J. E. SAVAGE 


(Manuscript received July 28, 1966) 


Two types of self-synchronizing digital data scramblers and descramblers 
are introduced and examined. The descramblers recover synchronization 
quickly after the insertion or deletion of channel bits, and they are relatively 
insensitive to channel errors. The scramblers act to increase the period of 
periodic data sequences, and the periodic channel sequences produced have 
approximately half as many transitions in one period as there are bits in a 
period. These circuits find application in common carrier systems where 
shori-period data sequences produce high-level tones in the transmission 
band and, as a consequence, interchannel interference. And they have applt- 
cation when receiver clocks derive synchronization from transitions in the 
channel signal. A number of variations and modifications of the scramblers 
which affect their cost and size are considered. 

The scramblers and descramblers are similar in construction and consist 
of linear sequential filters with either feed-forward or feedback paths, 
counters, storage elements and peripheral logic. The counters, storage 
elements and peripheral logic monitor the channel sequence but react tn- 
frequently so that the scramblers and descramblers behave principally as 
linear sequential filters. 


I. INTRODUCTION AND SUMMARY 


In this paper, we present two basic types of self-synchronizing digital 
data scramblers and descramblers. A scrambler is a digital machine 
which maps a data sequence into a channel sequence and, when the 
data sequence is periodic, into a periodic channel sequence with period 
which is many times the data period. When the source is periodic, the 
channel sequence produced by the scrambler also has many transitions. 

A simple scrambler and one which is often used is a machine which 
adds a maximal-length shift-register sequence’’” to the data signal. 
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The scrambled data signal is then descrambled by the subtraction of 
the same maximal-length sequence. While this procedure is simple and 
easily implemented, it suffers from the serious disadvantage that the 
‘insertion or deletion of bits in the channel signal results in a descrambled 
sequence which is a garbled version of the data signal. The scramblers 
presented in this paper have the self-synchronizing property* and 
recover quickly from the insertion or deletion of channel bits. 

There are two important applications for our scramblers. In common 
carrier systems small nonlinearities are present in modulators and 
demodulators which are used to frequency multiplex a bank of channels. 
Consequently, high-level tones in one channel may produce interference 
in other channels as a result of the nonlinearities in the mixing process. 
lor this reason, systems engineers place limits on the levels of isolated 
tones in a customer’s transmission band. Tones, in turn, are produced 
in digital data transmission systems by periodic data sequences and 
the limit on tone levels is then translated into a lower bound on the 
period of periodic channel sequences. Thus, our first application is to 
insure that any periodic source sequence is mapped by a scrambler 
onto a periodic channel sequence with sufficiently large period. 

The second application for our scramblers concerns the need for 
transitions in the amplitude of the channel signal so that receiver clocks 
can derive bit or frame synchronization from the channel sequence. 
Receiver clocks often derive synchronization by passing the received 
signal through a filter tuned to the spectral component corresponding 
to the basic baud length and then observing the zero crossings of the 
filter output. Since the amplitude of the filter output will decrease to 
the background noise level if no transitions occur in the amplitude of 
the received signal or if the density of transitions is small, it is clear that 
in this application it is desirable to guarantee many closely spaced 
transitions in the amplitude of channel sequence when the source is 
periodic. 

We introduce two basic types of self-synchronizing, digital data 
scramblers called multi-counter scramblers and single-counter scramblers 
and they are discussed in Sections IV and VI, respectively. Each 
scrambler consists of a “basic scrambler” and a ‘‘monitoring logic” 
which consists of additional storage elements, counters and incidental 
logic. We show in Section II that the “basic scrambler,” which is a 
linear sequential filter with feedback paths and tap polynomial h(a), 
responds to a periodic data sequence of period s by producing a periodic 


* R. D. Fracassi and T. Tammaru introduced the self-synchronizing descrambler 
in a special scrambler for which they have a patent pending.? 
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channel sequence whose period is either s or the least common multiple 
of sand p” — 1, where 1 is the number of stages in the basic scrambler 
and p is a prime greater than or equal to the number of elements in 
the source alphabet. The basic scrambler responds in this way to periodic 
inputs when its tap polynomial h(x) is primitive over the modular 
field of p elements, GF(p). The counters, logic, and storage elements 
of the monitoring logic monitor the channel sequence and respond 
whenever this sequence has as periods, one of the known data periods. 
The monitoring logic then reacts and changes the state of the basic 
scrambler, forcing it to have the long-period output. 

We show in Sections IV and V that a multi-counter scrambler exists 
for binary as well as non-binary sources and we find the smallest thresh- 
olds required on counters in the monitoring logic of this scrambler. 
The single-counter scrambler is considered in Sections VI and VII and 
because of analytical difficulties we are only able to show the existence 
of this scrambler when the source is binary (p = 2) and the source 
periods are all prime to 2” — 1, where 7 is the size of the basic scrambler. 
Mixtures of the scramblers for binary sources are examined in Sec- 
tion VIII. 

In Section IX we show that the scrambler output, when the input 
is periodic, contains many closely spaced transitions and that there 
are half as many transitions in one period as there are digits in that 
period. In Section XI we perform representative calculations to de- 
termine the spectrum of the scrambler output and find when the source 
is periodic that the output spectrum has P times as many tones as the 
unscrambled spectrum and each tone has 1/Pth the energy, where P 
is the factor by which the source period is increased. 

The descramblers for each of the scramblers are discussed in the 
sections in which the scramblers are introduced and they are also 
discussed separately in Section X. In that section, we show that the 
descramblers recover synchronization rapidly after the insertion or 
deletion of channel digits and we observe that the principal effect of 
infrequent channel errors on the descramblers is to multiply the number 
of channel errors by w(h), where w(h) is the number of nonzero terms 
in the tap polynomial h(x). In Section X we also note that the monitoring 
logics at the scrambler and descrambler reach threshold infrequently 
when the source is random and at most once when the source is periodic 
so that the descrambler monitoring logic may be removed and the 
descrambler considerably simplified as long as thresholding in the 
monitoring logic occurs at a tolerably low rate. 

An example is given in Section XII of the application of the scramblers 
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and descramblers and representative calculations are performed to 
determine which scrambler configuration is least expensive. Section 
XIII closes with conclusions. 


II. BASIC SCRAMBLER AND DESCRAMBLER 


The shift register circuit shown in Fig. 1(a) is a linear sequential 
filter with feedback paths* and is an example of the scrambling circuit 
which is basic to the multi-counter scrambler and to the single-counter 
scrambler discussed in later sections. The linear sequential filter with 
feed-forward paths* shown in Fig. 1(b) is the complementary circuit 
to that shown in Fig. 1(a) and regenerates the data sequence from the 
channel sequence. We assume in these two examples that data is 
presented as a binary sequence, that addition is taken modulo 2 and 
that the storage elements provide one bit of delay. 

Examination of the circuits of Fig. 1 show that they have the re- 
quired synchronization property since the effect of a bit lost or added 
in the line sequence is felt only as long as the values stored in the 
descrambler disagree with those stored at the scrambler, which is five 
bit intervals in our example. 

A more general form for the basic scrambler when the data is assumed 
to be a sequence of digits from the modular field of p elements, GF (p) = 
{0, 1, --- , p — 1}, where zp is prime, is shown in Fig. 2. Here, addition 


ae 


DATA 


LINE 





Fig. 1—(a) A basic scrambler; (b) a basic descrambler. 
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LINE 


Fig. 2— General basic scrambler. 


is taken modulo p and the outputs of the storage elements are mul- 
tiplied by the tap constants {c,, c., ---: , ¢,} drawn from GF(p). 
Here, multiplication is also taken modulo p. The tap constants must 
be constrained in a particular way if our scramblers are to extend the 
period of periodic sequences in the desired manner. Namely, the tap 
polynomial h(x) in the indeterminate x given below 


h(x) = o — 6a comet mores £2" (1) 


must be a primitive polynomial* over the field GF(p). This condition 
will guarantee that the sequence generated by the basic scrambler in 
the absence of input will be either all zero or a maximal length sequence, 
that is a sequence which repeats but once every p” — 1 digits. In 
the example given in Fig. 1, the tap polynomial is primitive over the 
binary field and it will generate a maximal length sequence of period 
2° — 1 = 81. (A(z) is a primitive polynomial of degree m over the field 
GF (p) if it is irreducible, that is, has no factors except 1 and itself, 
and if it divides x” — 1 forn = p” — 1 but does not divide it for any 
smaller n.) 


Theorem 1: The basic scrambler described above when excited by a pertodic 
sequence of pertod} s will respond with a periodic line sequence which 
has either period s or a period which ts the least common multiple (LCM) 
of sand p" — 1 (LCM(s, p” — 1)). The period with which the scrambler 
responds 18 a function of the initial values stored in the scrambler storage 
elements, that 1s, its initial state, and there is but one such state (for each 
phase of the input sequence) for which the line sequence has period s. 
For all other such initial states the line sequence has the larger period. 


This theorem is basic to all later results. It states that for only one 
starting state will the basic scrambler respond with period s to a data 
* A nonprimitive polynomial may produce more than two output periods for an 


input period (see Theorem 1). 
} A sequence will be said to be of period s if it has no smaller period. 


454 THE BELL SYSTEM TECHNICAL JOURNAL, FEBRUARY 1967 


sequence of that period. Thus, our objective, which is to extend the 
period of periodic sequences, is equivalent to detecting whether data 
preceding a periodic sequence has left the scrambler in the critical 
state for that sequence. Two basic methods of detecting the presence 
of the critical starting state when sequences of different periods are 
expected in the data are given in later sections. 


IiI. PROOF OF THE BASIC SCRAMBLER THEOREM 


Model a periodic input to the basic scrambler with a circulating 
register, as shown in Jig. 3 for an input of period 3. The initial state 
of the circulating register will be the first period of the periodic sequence. 
We let the vector y represent the state of the new circuit. Thus, if 
the input has period s and the basic scrambler has m stages, then y 
has s + m components where the first s represent (in reverse order) 
the first period of the periodic input and the last m components rep- 
resent the values stored in the corresponding storage elements when 
the periodic sequence begins. For example, y = (101101001) if the 
basic scrambler has the stored values 01001 when the sequence 
1101, 1101, 1101, --- arrives. 

The circuit of Fig. 3 is linear since the next set of stored values 
is a linear combination of the preceding set. Thus, the state y’ following 
y can be found by a matrix operation on y by the matrix 7 given below,* 
that is, y’ = Ty where y and y’ are taken to be column vectors. 


0 0 1/0 0 0 0 0 
1 0 0/0 0 0 0 O 


01 0\/0 0000 
T = —_—— (2) 

OO! 1 ta a 

00 0;)1000 0 

00010 100 0 

00 0\0 0100 

00010 001 0 


For the general basic scrambler and an input of arbitrary period, 


* For an excellent discussion of the matrix approach to linear sequential switching 
circuits see B. Elspas, The Theory of Autonomous Linear Sequential Networks, 
IRE Trans. Circuit Theory, 6, pp. 45-60, 1959, which is reprinted in Ref. 13. 
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LINE 


Fig. 3— Model of basic scrambler with periodic input. 


say s, the matrix 7 has the following form 


r= [i md: ® 
where & is s X s and is shown below 
Jo 00 -- 1 
L 0 0 -:-- O 
ae Uh Oh eee OP “) 
0 0 ree 0 


T, 18 m X m and is given as 


Ci Co eee Cy 
T, = 0 1 0 (5) 


| 
0 0 -- | 


Since the state y’ is found from y’ = Ty, all succeeding states are 
found by taking powers of 7’, that is, the 7th state succeeding y is 


a T’y. (6) 


The line sequence generated by a periodic input to the basic scrambler 
is periodic of the same period as the state y; of the circuit which models 
the basic scrambler and periodic input. Thus, we prove Theorem 1 
by studying the cycles of (6). 

There is an indirect approach” that one can take to study the cycles 
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of (6). It amounts to a proof that these cycles are isomorphic to cycles 
of a matrix 7'* obtained from 7 by eliminating the solitary 1 shown 
in (3). This amounts to disconnecting the circulating register from 
the basic scrambler and observing that the basic scrambler, which is 
a maximal length sequence generator,’ has period 1 or p” — 1, m = 
deg h(x), so that cycles of 7* have period s or LCM(s, p” — 1). The 
proof that the register can be disconnected amounts to showing that 
the minimal and characteristic polynomials of 7’ are the same and equal 
those of 7'*. Then, the elementary divisors of 7 and 7* are the same 
and their cycles are isomorphic. 

Since there is a direct proof of Theorem 1 which contains many 
results important to the remainder of the paper, we present it here. 
If the basie scrambler with periodic input starts with state y, then 
it has a cycle of length g if T’y = y. The basic scrambler output will 
then be periodic with period g. We now ask for those values of g for 
which 7’y = y has a solution. We begin by writing 


Y=y.+Yn, (7) 


where y, is such that its first s components equal those of y and its 
last m components are zero. The vector y,, is zero in its first s com- 
ponents and is equal to y in its last m components. We can interpret 
yn aS the “starting state’ of the basic scrambler and y, as the state 
of the model for the periodically driven basic scrambler when the 
starting state is zero. 

If 


T’y =y (8) 
then 
—T’y, ty. = T°¥m — Yn (9) 


since 7’ is a linear operator. We assume that the periodic input is fixed 
and has period which is strictly s. Then, the left-hand side of (9) is 
fixed and we ask whether a solution y,, for it exists for a given value 


of g. We note that 
| eel: 0 | 


where the asterisk indicates some submatrix. Therefore, T’y,, — Yn 18 
a vector whose first s components are zero. The left-hand side of (9) 
has its first s components zero only when g is a multiple of s because 
in that case R’ = J, , the s X s identity matrix, and otherwise R’? — 
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I, * 0 which means that a cyclic shift of the first s components of y, 
when added to y, is nonzero unless g = ks for some integer k = 1. 

If we use the notation (y)’ to indicate the last m components of y 
we have for (9) the following when g = ks 


(—T'y. + y.)’ = (Th — Dyn)’ (11) 
where J is now m X m. We use the following theorem on (11). 


Theorem 2: The matrix T, has characteristic polynomial h(x) which 18 
assumed primitive over GF(p). Therefore, T, — I is nonsingular for 
7=1,2,---,p" —2andT" = I forn = p” — 1. 

Proof: See appendix. 

Since 7} = I for n = p” — 1, we can reduce ks modulo n so that 7"; 
can be written as a power of 7, less than n. In particular for k < ko 


where kos is the least common multiple of s and n, which we call e, 
that is, 


e = ks = LCM(s, p” — 1), (12) 
the matrices 7’;* can be written as 7}* where 0 < 7, <n. We have 
CST, Sym aad) S77, (13) 


Returning to (11) we see that 7"* — J is nonsingular for 1 Sk < ky. 
Therefore, when & = 1, (11) possesses a unique solution y,, . That is, 
there exists a unique starting state y,, for each periodic sequence 
(modeled by y,) having period strictly s such that T’(y,, + Ys) =Yn+Ys- 
Similarly, there exists a unique solution to (11) foreach2 Sk < ky. 
However, if T’y = y,y = y, + y. , then Ty = y so that the cycles 
having period ks are really repetitions of the single cycle having period s. 
Also, when k = ky , T’** = I and T**y = y for all y. We conclude 
that for a prescribed input having period strictly s, the basic scrambler 
wil respond with period s for only one starting state y,, and for all other 
starting states will respond with period e given by (12). This proves 
Theorem 1. 

We have finished our discussion of the basic scrambler. We now 
consider the techniques used to detect the presence of a periodic sequence 
of low period on the line and present the first of two methods for altering 
the starting state of the basic scrambler. This first method is more 
general than the second and allows for the simultaneous detection 
of sequences of several periods. The second method applies only when 
the sequences expected on the line have periods which divide one of 
two numbers. 
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IV. THE MULTI-COUNTER SCRAMBLER 


The general form of the multi-counter scrambler (MCS) is shown 
in Fig. 4. (The descrambler is shown in Fig. 5.) There are N counters, 
one for each period s; , 1 S 7 S N, and the 7th counter will generate 
+1 if it reaches its threshold ¢,,. A counter is reset whenever the 
reset lead is nonzero so that ¢,, consecutive zeros on the reset lead 
of the zth counter will cause it to reach its threshold. All counter out- 
puts are fed to the OR circuit shown so that a 1 is generated at the 
exclusive OR and added to the “tap sum’’* whenever a counter reaches 
threshold. At the same time, all counters are automatically reset. 








Fi a COUNTER 


THRESHOLD 
LINE b 
Hp 


Fig. 4— Multi-counter scrambler. 






COUNTER 
THRESHOLD 


The input to the 7th counter is the difference between the present 
line digit and the digit transmitted s; clock intervals earlier. If the 
line sequence has period s;, then these two digits agree and the dif- 
ference is zero. Then, the zth counter will reach threshold, the tap 
sum will be altered and the state of the basic scrambler changed.t 
The line sequence will then be changed from period s; to period 
LCM (s; , p” — 1) where p is the size of the modular field GF(p) and 
m is the number of stages in the basic scrambler. 

* We define the “‘tap sum” as the quantity added to the next data bit at the input 
to the basic scrambler. 

} If the starting state of the basic scrambler is critical for a sequence of period s; , 


then the state after 7 clock intervals is critical for the jth cyclic shift of the input 
sequence. Hence, a change in the tap sum will force the next state to be noncritical. 
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We observe, then, that the multi-counter scrambler for any choice 
of thresholds {¢,,, 1 S 4 S N} will force the basic scrambler to switch 
from a critical state to a noncritical state whenever the input has 
period s,, s:, °°: , OF Sy or some period which divides an s; . It should 
be clear, however, that it is not necessary and perhaps not desirable 
to change the tap sum and the next state of the basic scrambler when 
the output does not have period s;, 1 S 7 S N, or some period which 
divides an s;. The next theorem specifies the minimum values of the 
thresholds ¢,,, 1 S « S N, so that the tap sum is changed only when 
‘necessary.”’ (Note that random data may generate line sequences 
which resemble periodic sequences and in such cases it will be ‘‘necessary”’ 
to change the tap sum.) 


Theorem 8 (MCS Theorem): The multt-counter scrambler shown in 
Fig. 4 will scramble a periodic sequence of period s tf s diwides s; for 
some t, 1 52S N, and will produce a pertodic line sequence of period 
LCM (s, p” — 1) tf the following two conditions are met: 


(4) The tap polynomial h(x) of degree m ts primitive over GF (p) where 
data sequences have components from GF (p). 


LINE 


THRESHOLD 
=ts, 


COUNTER 
THRESHOLD 





Fig. 5— Multi-counter descrambler. 
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(12) The thresholds t,,, 1 S 1 S N, are chosen as 


— 


bea 


+ 


= (m— 1) + max s;. 
1s7sN 
Tre 
If all input periods divide so , then the theorem holds when condition (2) 7s 
met and a threshold of t,, = m is used. 


The descrambler for the MCS, shown in Fig. 5, has the self-syn- 
chronization property as long as line errors do not occur. When line 
errors occur, the counters in the descrambler may not read the same 
levels as the corresponding counters in the MCS. However, as seen from 
the MCS theorem, the counters must reset at least once every (m — 1) + 
max s,; clock intervals when the input is periodic so that counter syn- 


chronization is easily established in this case. With random data the 
situation is not quite so clear. The question of descrambler synchroniza- 
tion, including the effect of channel errors, is considered in detail in 
Section X. 


V. PROOF OF THE MCS THEOREM 


We use the notation developed in Section III for the proof of The- 
orem 3. lig. 6 shows the basic scrambler with periodic input of period 
s and one counter. The only input to the basic scrambler other than 
the data input is the lead from the counters which is used to change 
the tap sum. The counter shown is assigned to the detection of line 
sequences whose periods divide s; . 

To prove Theorem 3 we must show that #,, can be chosen such that 


aja>ot COUNTER 
Dd \a=0! 6 THRESHOLD — 
{ =ts; 


I - 
RESET 





Lj 


Fig. 6—One counter of the MCS with periodic input. 
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the zth counter does not reach threshold when the data sequence is 
periodic unless the line sequence has period s; or some period which 
divides s;. Before we begin our proof we introduce some notation. 
In Fig. 6, we use 1; to indicate the jth line digit calculated with data 
from a periodic input of period s. The basic scrambler is shown and we 
indicate with the vector y the state of the linear sequential filter com- 
posed of the circulating register of s stages and the m stages of the 
basic scrambler. Call this filter of s ++ m stages the driven basic scrambler. 
Then, from (6) the next state of the driven basic scrambler, y’, is 


y’ = Ty (8) 


provided that the monitoring logic is not active. If it is active, that is, 
if one or more counters reach threshold, then 


y =Ty+y. (14) 


where y, contains a single one in its (s + 1)th position. 
The first line digit calculated with the periodic input, J, , is 


L = [Ty + wyil, (15) 
where 
[Zz]. = &e+l ’ (16) 
the (s + 1)th component of the vector z, and 
(2 monitoring logic active at first calculation, 
UW, = (17) 
lo otherwise. 
In general, the jth line digit is 
i= ry “ Duty, | (18) 
k=1 8 
~ where 
|! monitoring logic active at kth calculation, 
Un = (19) 
lo otherwise. 


Now consider the sequence {a;} calculated at point A of Fig. 6. 
If a run of consecutive zeros in this sequence is large enough, the ith 
counter will reach threshold unless some other counter reaches threshold 
before it. When the periodic input begins, the counters in the MCS 
will be at unknown levels and the (max s,) stored values will be, in 
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general, unrelated to the input; thus one or more counters may reach 
threshold before /, reaches the s;th storage element of the MCS.* We 
now wish to show that the sequence {a; ,7 2 s; + 1} will contain a 
run of no more than ee (m — 1+ 8;) zeros 2f the line sequence 1s periodic 


with a period which rie not divide s; 
We have for7 = s; + 1 


a; = —l, + a (20) 


and 


a; = -| 2e-erey —y t+ duly, - ary, | . (21) 
k=1 k=1 8 


If u,,;,, = 0 then let 7, be such that u; = 0,s; + 1 Sj <p andu,;, = 1, 
that is, the most recent thresholding occurs at 7 = jo. (The case u; = 0 
for all 7 = s; + 1 is trivial, so we assume that u;, = 1 for some 4) .) 
Then, if we write z; as 


To—si 


= Dur A 2d YP Nii (22) 

and if we ignore all counters except the zth, we have 
a, = (Pere ry — y} + 2)], (23) 
for jo S97 So + t,,; — 1. For this range of 7 the a; can be viewed as 


the values appearing in the (s + 1)th storage element of the driven basic 
scrambler with starting state y* 


yr = Perry — y} +2, . (24) 


Now, assume that input period s does not divide s; . Then, the first s 
components of T7°-*‘{7"‘y — y} are not all zero. Since z; is zero in 
its first s components, the starting state y$ 1s nonzero in some of its 
first s components. Consequently, the state of the driven basic scrambler 
(of s Me m stages) can never be completely zeroy so that the sequence 
{a;, 7 = jo} cannot contain more than s + m — 1 consecutive zeros 
if s does not divide s; .{ We shall now show that, in fact, the sequence 
{a;, J = jo} cannot contain more s + m — 2 consecutive zeros if s fs; . 
We shall also show that there exists an input of period s if s; = ks + 1 


* Note that the lead from the counters will be active at most once during the first 
s; calculations if s; < min ts, 


+t Note that the ee operator 7’ just circulates the first s components of y*. 
t We will use the notation s ¥ s; to mean s does not divide s; . 
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for some integer k = 1 such that the sequence {a;,7 2 Jo} will have 
this many consecutive zeros. 

In (24) the first s components of y* are a cyclic shift of the first s 
components of 7°‘y — y. Recalling the definition of 7 from (8) we see 
that these s components are the components of the vector (R** — I)(y,)”’ 
where FR is s X s and is given by (4) and (y,)” is the vector containing 
the first s components of y, . The vector (R*' — J)(y,)’’ cannot contain 
a single nonzero element if s/s; as seen below by example: Let s = 4, 


s; = 5 and (y,) have components y; , yo, Ys, Ys . Then, we have 


—1 0 1 01 4 b 
Dy =| 9 7h 8 FES), es) 
1 0 —-l 0 ll ys 0 
0 1 O —Il1 Lys 0 
where 6 is the single nonzero element. Thus, 
—y7. + ys = b #0 
—Y2 tT Ys = 0 (26) 
+yi — ¥s = 0 
+Yy2 — Ys = 0. 


It is clear that two equations, the first and third, cannot both be satis- 
fied. This will be true regardless of the location of the single nonzero 
element. Hence, there must be at least two nonzero elements in the first s 
components of y* . Consequently, {a;, 7 2 jo} cannot contain a run 
of more than s + m — 2 consecutive zeros. If y, contains a single non- 
zero element and if s; = ks + 1 then (R** — J)(y,)” will contain two 
consecutive nonzero elements. Also, since y,, is in general arbitrary, 
it can be chosen so that the first s + m — 2 digits generated with y* 
as the starting state will be zero. 

At this point, we have shown that a periodic input of period s, where 
s|s;, some 7 # 7 but sf}; , will not cause the 7th counter to reach 
threshold more than once after the s;th line digit is transmitted if 
we choose t,, to be 


t,, = (m — 1) + maxs; . (27) 


ran) 


This is true since the sequence generated at point A of Fig. 6 will not 
show more than /,, consecutive zeros after the first time the monitoring 
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logic is active following the transmission of the s,th line digit. We 
also note that a threshold of the size given above may be necessary 
if there exists an s such that s; = ks + 1, some k = 1, where s | s;, 
some 7. 

Consider next the case where s | s; . If the line sequence is periodic 
of period s (see Theorem 1) at any time after the periodic sequence 
begins, the sequence at point A will contain an indefinite number of 
zeros so that the zth counter will definitely reach threshold (unless 
s|s;, some 7 ¥ 2, andt,,; < ¢t,;, in which case the jth counter may 
reach threshold first). Since there is only one critical state for each 
periodic sequence, the change in the tap sum resulting from the detection 
of the period s line sequence will cause the output to have period 
LCM (s, p” — 1). In this case the vector y* in (24) cannot be entirely 
zero (its first s components are zero, however) because it would then 
result in an all zero sequence at point A. Thus, the last 7 components 
of y= must contain at least a single nonzero component. But [T’~“*y*], 
(which generates {a; , 7 = jo}) then is just the output of a maximal 
length sequence generator (see appendix) so that no more than m—1 con- 
secutive zeros will be seen at point A if s | s; and the output has period 
LCM (s, p” — 1). 

In conclusion, zf s | s; but the output does not have period s or if s X 8; 
but s | s; some 7 # 1, then the ith counter will reach threshold at most once 
after the transmission of the s,th line digit tf the tth threshold t,,; 1s chosen 
as 


it, =(m—-—1)+ max s; . (28) 


14 


Of course, the same is true for any threshold larger than 7,, . 


VI. THE SINGLE-COUNTER SCRAMBLER 


The single-counter scrambler (SCS) is shown in Fig. 7 (and the de- 
scrambler is shown in Fig. 8). This scrambler is designed to scramble 
periodic binary sequences whose periods divide either s, or s, or both. 
It has a single counter and for some applications may be less costly 
to build than the multi-counter scrambler. And while we consider 
the SCS when the input periods divide either s, or s. or both, one may 
be able to design for the case of many more input periods. 

The SCS has two circuits for detecting periodic sequences. If either 
or both of the two detecting circuits produces 0 at any one time, one 
cannot with a single measurement determine whether the line sequence 
has period s, or s,. On the contrary, if both circuits produce a nonzero 
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Fig. 7—Single counter scrambler. 


output, it is clear that the line sequence does not have period s, or 8, 
and that the counter should be reset. A 2-input AND gate has a non- 
zero output only when both inputs are nonzero, consequently, we use 
it as input to a counter, as shown in Fig. 7. This counter will reach 
threshold after ¢ line transmissions if each of ¢ consecutive pairs of 
outputs of the detecting circuits contains one or more 0’s. | 
The major design problem of the SCS is the choice of the counter 
threshold. This is not an easy problem, unfortunately, and all that we 
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Fig. 8—Single counter descrambler. 
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have been able to say about it is that counter thresholds do exist when 
p = 2 (the source is binary) and the input periods are relatively prime 
to 2" — I and then to only give a gross upper bound on the smallest 
permissible threshold. The following theorem states what is known about 
the threshold for the SCS. 


Theorem 4 (SCS Theorem): A single-counter scrambler which will 
scramble all periodic binary sequences with periods which divide s, or 8» 
(8s, < 8, S¥ 82) exists tf 


(1) the tap polynomial h(x) of degree m is primitive over GF (2), 
(72) s, and s. are relatively prime to 2” — 1, and 
(217) a counter threshold, t, t S s.(2" — 1) — 2"”* + 2 ts chosen. 


This theorem does not rule out the possibility that an SCS exists 
when s, and s, are not both relatively prime to 2” — 1 nor does it 
rule out an SCS for nonbinary data. It simply states that when condi- 
tions (i) and (ii) are met, one can show that a counter with threshold 
t,t S s,(2” — 1) — 2”-* + 2, will not reach threshold when the output 
of the basic scrambler has period s K (2” — 1) where s divides s, 
or s. or both. In fact, the bound on the threshold required to prevent 
the counter from reaching threshold prematurely is many times larger 
than necessary. In the example given in Section XII the bound is 
more than 35 times too large. 


VII. PROOF OF THE SCS THEOREM 


For the proof of Theorem 4, we recall the proof of Theorem 3. In 
particular, it 1s instructive to review the discussion surrounding equa- 
tions (20) through (24). We recall that a; of (20) is the jth digit cal- 
culated (after the arrival of the periodic data sequence) at the input 
to the 7th counter of Fig. 6. We argued that if the 7th counter reaches 
threshold on the j,th calculation, 77 2 s; + 1, then a; could be cal- 
culated from 


a, = —[T""y%]. (29) 


for 7 = jo and until the next time the zth counter reaches threshold. 
Here [y,] indicates the (s -++ 1)st component of y and y* is given by (24). 
Thus, the sequence generated at point A of Fig. 6, namely a;, , d;,41, °° 
can be viewed as generated by the basic scrambler with periodic input 
and y* as starting state. 

In Theorem 4 we assume that the data sequence is binary so that 
the above equations apply if we interpret subtraction as addition since 
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they are equivalent on the binary field. In Fig. 7 the sequences {b;} 
and {c;} are generated at points B and C, respectively. We wish to 
show that the largest run of consecutive zeros in the logical AND of 
{b;} and {c,} after a certain transient period cannot exceed s.(2” — 1) — 
2”"~" + 1 when s, < s, and s, and s, are both relatively prime to 2” — 1. 

Consider the sequences {b; ,7 2 8s. +1} and {c;,7 2 s. + 1}. Then, 
if the counter reaches threshold at* 7, , 7; 2 Ss. +1, these two sequences 
will cause the counter to reach threshold only once more if the line 
sequence has period s, s | s, or s | s. or both. If the line sequence has 
periody s X (2” — 1), then neither {b; , 7 2 j,} nor {c;, 7 2 j,} can 
be all zero since this would imply that the line sequence has period 
§) OFS. 

We now consider two cases, case I when s divides both s, and s, 
and case II when s divides s, but not s, or vice versa. From (24) it 
is clear that in case I both {b; ,7 = 7,} and {c; ,7 = 7} are the outputs 
of basic scramblers with no input and with nonzero starting states 
so that they repeat with period 2” — 1. In case II when s | s,, say, 
but not so, {b; , 7 2 j:} 1s the output of a basic scrambler with nonzero 
starting state and has period 2” — 1 while {e; , 7 2 j,;} is the output 
of a driven basic scrambler with input period s. (The input may not 
be strictly of period s, however, as we shall see later.) 

The logical AND of the sequences generated at points B and C 
of Fig. 7 can be interpreted as the sequence generated by the normal 
arithmetic multiplication of b; and c; . Thus, the sequence at point D 
of Fig. 7 has period 2” — 1 in case I and period s X (2” — 1) in case II. 
Let B,, and C, be n component vectors with B; = b;,,, and Ci = ¢;,41. 
Then, at point D, the vectors B, and C,, generate the n-vector D, = B,-C,, 
where multiplication of B, and C,, is term-by-term, i.e., 


D, = (BC, , BC, eo BCs): (30) 


Let w(y,) be the Hamming weight’ of the n-vector y, , that is, 
the number of 1’s in y, . Then we have’ 


w(D,) = w(B,-C,) = MEF WE = WES) 31) 
where addition is modulo 2. We now wish to use this last equation to 
find a lower bound on the number of 1’s D,, . From this we can obtain 
an upper bound on the number of consecutive 0’s in D, and an upper 


* It may indeed reach threshold for 1 $7, S sz but this does not affect our analysis- 
} s; and se are relatively prime to 2” — 1 so that the line sequence has period 
s X (2” — 1) if s | s; or s | s2 as seen from Theorem 1. 
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bound to the threshold required to prevent the counter of Fig. 7 from 
making unnecessary changes in the tap sum. 

In case I we let n = 2” — 1, which is the period of the sequence 
generated at point D. Thus, D, is one period of this sequence. B,, and C, 
are each one period of the output of the basic scrambler (which is a 
maximal length sequence generator). Thus, B, can be obtained from 
C,, by a cyclic shift and they both have the same Hamming weight. 
Then, we have the following result. 


Lemma 1: Ifn = 2” — 1 and B, and C, are periods of a maximal length 
sequence with B, = C,, then w(D,) = w(C,) = 2”. If B, 4 C,, then 
w(D,) = w(C,)/2 = 2”7*, 


Proof: We need only show that w(C,) = 2””*. From the comments 
at the end of the appendix we have that the state of the autonomous 
basic scrambler, as a binary m-tuple, ranges through all 2” — 1 nonzero 
binary m-tuples. Since the first digit of each m-tuple is a line digit, 
there will be exactly 2”~* 1’s in one period of the line sequence generated 
by the autonomous basic scrambler. (Note that the scrambler does not 
start with the zero state.) Q.E.D. 


In case I, then, the number of consecutive zeros in the sequence 
at D cannot exceed 2" — 1 — 2” and a threshold of 2” — 2” ° will 
guarantee unnecessary tap sum changes in this case. 

Consider now case IJ where {b;, 7 2 7,} has period 2” — I and 
fe; , J = i} is the output of a driven basic scrambler characterized by 


ce = (T "yi]. , (32) 
where y* is an (s + m)-vector which from (24) has the form 
yea Te r{Ty +y} +z (33) 


where y is an arbitrary (s -+ m)-vector, except that its first s components 
model a periodic sequence of strictly period s, and z is zero in its first 
s components and arbitrary in its last m components. The first s com- 
ponents of T°*y + y cannot be all zero if s / s. . It may model a periodic 
sequence of period sy , however, where so < s and Ss | s. In particular, 
we may have s) = 1 in which case the first s components of y4 may 
be 1’s and {c;, 7 2 7,} may have an output of period 1 consisting of 
the all 1 sequence. If this is true D, = B,-C, = B, and D, will have 
no more than m — 1 consecutive zeros. If {c;, 7 2 7,} has period 
S) > 1 it will contain no less than a single nonzero component in each 
period nor no more than s) — 1 nonzero components in each period. 


DIGITAL DATA SCRAMBLERS 469 


Thus, if nm = 8), So > 1, and C, represents one period in the output 
of period sy, we have 


Is w(C,) S (% — J). (34) 


When {c; , 7 2 j,} has period s) X (2” — 1) letn = gs X (2” — 1) 
in (81). We now show that for this case 


HS 1) SC, as Ko: (35) 


The state of the driven basic scrambler with input of period s) can 
be represented with an (s, -+ m)-vector. There are s X 2” admissible 
state vectors since the last m components are arbitrary and the first sp 
components must be a cyclic shift of the first ss components of some 
other state vector. The last m components of these so X 2” vectors 
range through each of the 2” m-tuples s) times. Since the (s + 1)st 
component of each state vector is a line digit w(C,) S s) X 2”~* which 
is the number of 1’s shown in these positions. We also have w(C,,) = 
S X 2” * — gs since the components of C, are generated by only 
So X (2” — 1) of the s X 2” admissible state vectors and the missing 
state vectors may all contain 1 in the (s + 1)st components. 

Returning to (81) we see that the vector B, + C, appears. It represents 
the first n components of {b; +c; ,7 2 7:}. This is the output sequence 
of a driven basic scrambler driven with period sy and which has as 
a starting state the state which produces {c; , 7 2 ji} and which is 
modified by the addition in its last m components of the starting state 
of the autonomous basic scrambler which produces {b; , 7 2 j:}. Since 
this last state is arbitrary, B, + C, can be expected to have period 
So OF So X (2” — 1) and the bounds on the weight of C, for these two 
periods apply to B, + C,. 

We now combine our bounds with (31) to obtain a lower bound 
to w(D,,) for case II. Remember that n = s,(2” — 1). 


(<) Let C, have period sy), then B, + C, has period s) X (2” — 1) and 


w(D,) = 22" + OP =D ma KB" _ B= 1 


> 5 sO) 


where w(B,) = 8 X 2””' from Lemma 1. 


(zt) Let C, have period so X (2” — 1) and B, + C, have period sg . 
Then 


8 X 27° + 8,(27" — 2) — ( — DQ" — D = 2 


w(D,) > ; 2+. (37) 
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(172) Let C, and B, + C, have period s,(2” — 1). Then 


So x a + S(2" _ 1) — & 4 guns 


> 
w(D,) = 9 


=F" — 1. (8) 
Therefore, the number of 1’s in the sequence D, of n = 8s) X (2” — 1) 
components for case II must exceed (2” — 1)/2 — 4 and the number 
of consecutive zeros cannot exceed s) X (2” — 1) — (2” — 1)/2 + 3. 

Combining the results for cases IJ and II we find that the number 
of consecutive zeros at point D of Fig. 7 when s, < s2 , 8; 4 Ss, and the 
input has period s, s | s, or s | s, or both, will not exceed s,(2” — 1) — 
(2" — 1)/2 + $ unless the line sequence has period s. The threshold 
then need not be any larger than s,(2” — 1) — 2”7* + 2 to prevent 
unnecessary changes in the tap sum. Q.E.D. 


VIII. MIXTURES OF THE SCRAMBLERS 


The two types of scramblers given above are distinguished by the 
structure of their monitoring logics. The MCS has one counter for each 
of the input periods s,, s., --: sy and the SCS has a single counter 
to detect the presence of one of two periods, s; or s.. We have found 
the smallest threshold required on each counter of the MCS so that 
they change the tap sum only when necessary. Also, we have shown 
the existence of a finite threshold on the single counter of the SCS 
when the source is binary and input periods are relatively prime to 
2” — 1, where m is the number of stages in the basic scrambler. 

Since the monitoring logic for both counters acts to detect the pres- 
ence of periodic sequences of known periods in the line sequence, 
it should be clear that a monitoring logic containing a mixture of the 
MCS logic and the SCS logic may be used. We know of an SCS monitor- 
ing logic only when the source is binary, however, so that the mixture 
must be restricted to the binary source case. Thus, we may now con- 
sider a scrambler with a monitoring logic, a portion of which has counters 
detecting the presence of one of a pair of periods and another portion 
consisting of individual counters for single periods. The outputs of 
all counters are fed to an OR gate which in turn is added modulo 2 
to the tap sum. The output of the OR gate is also used to reset all 
counters. 


IX. TRANSITIONS IN A SCRAMBLED SEQUENCE 


The basic scramblers described above may have applications in 
situations where bit framing at the receiver is derived from transitions 
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in the line signal. In this section we show that transitions occur fre- 
quently in a scrambled periodic sequence and that in one period of a 
scrambled sequence there are approximately half as many transitions 
as there are digits. These results are shown when the source is binary 
and the scrambler input periods are relatively prime to 2” — 1, where 
m is the size of the basic scrambler. 

Let 7 represent one period of the line sequence generated by the basic 
scrambler when the input has period s. If the source is binary, if the 
basic scrambler has m stages and if s is relatively prime to 2” — 1, then 
lis an s(2” — 1) component vector. If we assume that the binary hne 
sequence is converted into a line signal by the mapping 1 — 10— —1, 
and if it is linearly modulated, then transitions in the channel signal 
occur whenever transitions in the line sequence appear. Thus, we should 
like to know the number of transitions in 7 and the maximum separation 
between transitions. 


Theorem 5: The binary vector 1 of length s(2” — 1) representing the 
response of a binary scrambler to an input of period s, when s and 2” — | 
are relatwely prime, has at least one transition every s + m digits and has a 
total of Tr(L) transitions where 


1/2" — 2 Tr () a wh ) 

5 ($= 2) s Os} a I oy 
We begin by showing that every set of s + m consecutive line digits 
must contain at least one transition. The scrambled sequence is the 
response of the basic scrambler of Fig. 2 to an input of period s. We 
note that if the basic scrambler is in the all zero state then the tap 
sum (which is added to the data bit) is zero. Similarly, if it is in the 
all 1 state the tap sum is zero because if not, h(1) = 0 and h(z) is 
divisible by x — 1 which is impossible since h(x) is irreducible. Then, 
if s + m consecutive outputs of the scrambler are identical, the last s 
of the (s + m) corresponding tap sums are zero so that s consecutive 
data bits must be identical. This cannot happen if the source is periodic 
with period greater than 1. When s = 1, the line sequence must have 
period 1 if s + m consecutive line digits are identical, which also cannot 
happen since the line sequence has period 2” — 1 in this case. 

We now bound Tr (2), the number of transitions in one period, l, 
of the line sequence. We use the notation of Section V so that the jth 
digit of 7, namely 1; is written 

1; = [T"y]. , (40) 


where 7’ is given by (8) through (5) and y is the state of the driven 
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basic scrambler at the beginning of a period of the data sequence. 
Let us now observe that a transition occurs in between two digits in 1 if 
they sum to 1 modulo 2. Thus, the number of 1’s in 7 + I’ (where 0’ is 
one cyclic shift of 2 and addition is term-by-term) is the number of 
transitions in J. For example, if 2 = 10110, ’ = 01011 and?+ UV = 11101 
then the number of transitions in J, including the implicit transition 
at the first digit is the Hamming weight of 7 + I’. 

In the process of proving Theorem 4 we have shown (see (35)) that 
the Hamming weight of one period of the output of the basic scrambler 
when the input is binary of period s) and s, and 2” — 1 are relatively 
prime lies between s,(2""-* — 1) and s,2”~’. Hence, if we can show 
that 7 + I is one period of the output of the scrambler with input 
period So , we will have established Theorem 5. 

We note that 


i+ = [My +7 'y). (41) 
so that we now examine 7’*'y + T’y. We have 
Ty + Ty = 17°" + Dy. (42) 


As in (7), let y = y, + y,, where y, is zero in its last m components, 
YY 18 zero in its first s components and they represent the periodic 
input and starting state of the basic scrambler, respectively. Then, 


(i+ Dy = yet yet Oye, 0) +m + Tm, (43) 


where y; is a single cyclic shift of y, in its first s places and (0, y, , Q) 
is a vector with a single component y, in the (s + 1)st position. If we 
use (z)’ to represent the last m components of z, then 


(Yn + LYm)’ = (La 7 Lm)(Yn)’ (44) 


In the appendix it has been established that 7, ++ J,, 1s a nonsingular 
matrix. From this we deduce that the last mm components of (7 + I)y 
range over all 2” m-tuples as y,, ranges over all m-tuples. 

Now consider y, + y’, which represents the first s components 
of (T + L)y. While y, models one period of a data sequence with period 
exactly s, y. + yf may model a sequence with period sp» , Ss» | s. For 
example, let y, = (1001000), then y, + y{ = (0101000) and its first 
4 components represent two periods of a period 2 sequence. Thus, 
we must view (7 + J)y as the starting state of a driven basic scrambler 
with input period s) where s) | s. We then ask if the sequence generated 
by this state has period s) or s)(2” — 1). Since y is noncritical, the 
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sequence generated by (7’ + Dy must have the larger period because 
if y were critical (7° + Dy = 0 for some 7,1 S$ 7 S s,(2” — 1) — 1 
and (7° + I)(T + Ty = 0 as well for some 7 in this range so that 
(T + Dy is a critical state. But we have shown in Theorem 1 that 
there is only one critical state for each periodic input. In the last para- 
eraph we have seen that there is a one-to-one mapping between the 
last m components of y and the last m components of (7 + J)y, hence 
if y ig noncritical, (7 -+- Dy is noncritical and the line sequence gen- 
erated by (7 + Dy has period s,(2” — 1) where % | s. 

The vector 7 + Ul contains s/s, periods of a sequence of period Ss . 
Let C,, represent one such period. Then from (35), the number of 1’s 
in C,, , w(C,), is bounded by 


s(2”"* — 1) S w(C,) S 52"". (35) 
Then, 
Tr) = wl + 0) 
and 
s(2""* — 1) S$ Tr() S 82”" (45) 
which gives the desired result after division by s(2” — 1). 


X. THE SELF-SYNCHRONIZING DESCRAMBLERS 


In this section, we show that the descrambler for each of the scramblers 
given above has the self-synchronizing property, that it is relatively 
insensitive to channel errors and that in some applications it can be 
considerably simplified by removal of the monitoring logic. 

Each scrambler is of the form shown in Fig. 9. Each descrambler can 
be represented as shown in Fig. 10. The output marked ‘‘data’’ in 
Fig. 10 is indeed data if the scrambler and descrambler are both started 
in the same state and no channel errors occur since (2) the line sequence 
will then pass through both basic scramblers and (22) the modulo p 
sum of a data bit, tap sum, line bit, and monitoring logic output is 
zero at both the scrambler and descrambler. 

If there are no channel errors we would like to show that the de- 
scrambler will synchronize itself should it ever lose synchronism. The 
descrambler will be said to be out of synchronism with the scrambler 
if either the values stored in the basic scrambler and the delay elements 
differ from those stored in corresponding sections of the scrambler 
or if the counters in the monitoring logic are not at the same levels as 
those at the scrambler or both. It is clear that the sy stages (if the largest 
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Fig. 9—- Block diagram of the scrambler. 





expected period is sy) of the basic scrambler in the descrambler and 
delay section will be purged after sy clock intervals and replaced with 
accurate information if there are no channel errors. Then, after sy 
clock intervals the monitoring logic at the scrambler and descrambler 
both are fed the same information. The monitoring logics will then 
reach synchronism when either (2) counters at the scrambler and de- 
scrambler reach threshold together in which case all counters are reset 
simultaneously or (22) the last sy ++ 1 digits of the line sequence Is 
found to be inconsistent with a periodic sequence of period s,, 82, °-° 
or sy and the counters at the descrambler are reset individually but 
in synchronism with those at the scrambler. When the data sequence 
is pertodic of period s,, S., °*: or Sy the 7th counter of the MCS is 
reset (following the transient interval associated with the arrival of 


MONITORING 
LOGIC 





Fig. 10— Block diagram of the descrambler. 
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the periodic sequence) at least once every t,; = m — 1 + max s; clock 

jt 
intervals. With the SCS the single counter is reset at least once every 
s.(2” — 1) — 2”* + 2 clock intervals when the input has period 
S$, OF S. , 8, < 82. Should the input sequence be random, the monitoring 
logics may be brought into synchronism because one of the counters 
reaches threshold and all counters are reset, which is unlikely, or because 
the counters are reset individually in synchronism with the scrambler 
counters, which is very probable and increases in probability very 
rapidly to one. (If the source is binary with independent, equiprobable 
outputs, the 2th counter of the MCS descrambler is resynchronized 
in the second manner after n clock intervals with probability 1 — 2™"; 
similarly, the counter of the SCS descrambler is resynchronized with 
probability 1 — (3)”.) 

Channel errors can affect the process of resynchronization. However, 
if we assume that they are relatively few in number, say, occurring 
once in every 10° transmissions, there will be long intervals during 
which resynchronization can take place. Since the descrambler re- 
quires at most sy + max ¢,, (which equals 2sy + m — 1 in the MCS 
case when s; S sy and is at most s,.2” — 2” * + 2 in the SCS case) 
clock intervals to resynchronize when the source is periodic, resyn- 
ehronization will not be a problem with periodic inputs if m and sy 
(or S2) are reasonable in size. When the source output is random and 
is a sequence of independent, equiprobable binary digits, the average 
number of clock cycles required by the ith counter of the MCS de- 
scrambler to resynchronize (in the second way described in the preced- 
ing paragraph) is two so that the MCS descrambler will resynchronize 
on the average in sy + 2 clock intervals. The counter of the SCS de- 
scrambler will require four clock intervals on the average to resyn- 
chronize so that the SCS descrambler will be resynchronized on the 
average in sy + 4 clock intervals. Hence, we may conclude that re- 
synchronization in the presence of channel errors which are relatively 
few in number will not be a problem when the source is random. In 
fact, it may be easier to resynchronize when the data is random than 
it is when the data is periodic. 

Now assume that the scrambler and descrambler are operating in 
synchronism and consider the effect of channel errors on the descrambler 
output. If we neglect the monitoring logic for a moment, it will be 
be seen that an isolated channel error, as it passes through the basic 
scrambler, will cause w(h) output errors, where w(h) is the number of 
nonzero terms in the tap polynomial h(a). The monitoring logic, however, 
may fail to act when it should or act when it should not and thereby 
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introduce additional errors. If we consider the effect of a single channel 
error on the monitoring logic, we see that this error has a direct effect 
on the zth counter of the MCS at two occasions, when it enters the basic 
scrambler and when it reaches the s;th storage element. A single channel 
error has a direct effect on the counter of the SCS three times, once 
when it enters the basic scrambler and again when it enters the s,th 
and s.th storage elements. When the channel error effects a counter 
of the descrambler, it may cause it to reset when it should not, which 
will not cause any harm if the counter is about to be reset before 
reaching threshold, as is the case for the known periodic inputs or as 
frequently happens with a random source. A channel error which 
causes a counter to continue to count when it should reset may indeed 
be harmful since it may result in its reaching threshold and introduce 
an unnecessary change in the descrambler output. This event is un- 
likely to happen for the known periodic source sequences since the 
counters reset frequently, and the number of clock intervals between 
a set of three normal counter resets is often less than a given counter 
threshold. It is also unlikely that a channel error will eliminate a reset 
and cause a counter to reach threshold when the source is random. 
For example, when the source is a binary, equiprobable, independent 
letter source the average separation between three resets on the MCS 
counters is four clock intervals and is eight clock intervals on the SCS. 
We may conclude then that channel errors have a small effect on the 
monitoring logic and thus affect the descrambler primarily by producing 
approximately w(h) as many output errors as channel errors. 

The descrambler can be considerably simplified, the problem of 
synchronization loss in the descrambler monitoring logic eliminated, 
and the problem of output errors due to the monitoring logic solved, 
all by the removal of the monitoring logic at the descrambler. This 
is not the drastic solution that it might seem for the monitoring logic 
reacts infrequently on random data and at most twice on known 
periodic inputs Gf counter thresholds all are larger than the largest 
expected input period). With a binary, independent, equiprobable 
letter source, one or more of the N counters of the MCS reaches thres- 
hold in n transmissions with a probability, Pa;(n), which is less than 
or equal to 


Pi) S Dia@-& +2", (46) 


where t; is the threshold on the 7th counter and t; = ¢,, . The single 
counter of the SCS reaches threshold ¢ in n transmissions with prob- 
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ability Ps(n) where 
Psm) Sr —t+1)(). (47) 


Hence, if the thresholds are large enough so that P,(n) or P,(n) is 
less than 0.1, say, when n equals the average number of transmissions 
between channel errors, then we may safely say that the monitoring 
logic at the descrambler is not necessary on random data inputs. 

When the source is periodic of period s however, one of the p” starting 
states* of the basic scrambler will result in a line sequence of period 
s which subsequently will require at least one and at most two outputs 
from the monitoring logic. Thus, if the data preceding a periodic input 
is random, the monitoring logic at the descrambler will with probability 
1/p” change at least 1 digit in the descrambler output. Hence, if a 
customer can tolerate such an error rate and if the thresholds are 
large enough, the monitoring logic at the descrambler can be removed 
and the descrambler will then simply consist of a basic scrambler. 


XI. THE SPECTRUM OF THE SCRAMBLER OUTPUT 


In this section, we perform representative calculations to show the 
effect of scrambling on the spectrum of a linearly modulated carrier. 
Assume that the source is binary and that a binary sequence is con- 
verted into a waveform by the mapping 0 > —1, 1 — +1. Let T, be 
the time interval alloted to each binary digit and let /(¢) be the wave- 
form generated by the binary sequence J. Then, we have 

, “~~ 
L(t)-b(¢) = —( + b)(%) (48) 


where addition is taken modulo 2 and multiplication 1s on the reals. 
The autocorrelation function of a waveform I[(¢) is defined as 


Ri(z) = lim = } Mowe + 9) a. (49) 


If 2 is the output of the scrambler when the input is an equiprobable, 
independent letter source, then J is a sequence of independent, equi- 
probable, binary digits. Then, we have 


Ri(t) = (i - el) lr| 3%, 


0) fe ok Wie ae re 


(50) 


* » is the input alphabet size and m is the number of stages in the basic scrambler. 
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The power density spectrum of ((¢), which is the Fourier Transform 
of R,(r) is for the random binary source 
7 2 
si) = 7,(Se tee)’ 61 
afl", 

Now let the source be periodic and assume, as an example, that 
it has period 8 and that the following sequence is one period of the 
source output: 10110010. Then, if 7 represents this sequence and if 
it is transmitted without scrambling, we find using (48) that it has the 
autocorrelation function, R,(r), of Fig. 11. The power density spectrum 
of I(t), S:(f), is given below and shown in Fig. 12. 


sin rf7"y\" sin 27fT,\? | 1 j 

aie ar, ( nfT ~ ( InfTy ) | Zee lr 7 nal Oe) 
Here 6(-) is the Dirac delta function. Thus, S,(f) contains isolated 
tones spaced by 1/7, , T, = 8T, , the period of the data sequence. 

If the periodic data source of period s is now scrambled, the line 
sequence has period 7,(LCM (s, 2" — 1)). Assume now, as an example, 
that s and 2” — 1 are relatively prime so that the line sequence has 
period PT, , P = 2” — J, the scale-up factor, and 7, = s7, , the 
source period. Now let / represent one period of the binary line sequence. 
Then, if 1, represents k cyclic shifts of 1 we have 








1 PTi1/2 i: ‘ 
RRL) = pa id Lolo at. (53) 
When k = +1, +2, --- , +(P — 1), we have 
RAkT,) = = (No. l’sin (I+ 4) —No.0’sinti +i). (64) 
1 


Since f(r) is linear in 7 for (k — 1)T) S 7 S kT we need only have 
Ri(r) atr = kT, , k = O, 41, +2, --- . We note that R,(kPT,) = 1, 


Ry (7) f 





Fig. 11— Autocorrelation function of period 8 sequence. 
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suf) { 


-1/T is To f—> 
1/8T 


Fig. 12—Spectrum of period 8 sequence. 


k = 0, +1, 42, --- . To further evaluate (54), however, we must 
return to Section IX. | 

We have seen in Section IX that 7 + 1, represents several periods 
in the output of a basic scrambler driven by an input of period 89 , 89 | s, 
and started with a noncritical state y. The proof of this result amounted 
to showing that the operation (7 + J) on y mapped the last m com- 
ponents of y one-to-one onto the last m components of (7 + J)y. 
Thus, if y is critical so is (7 + J)y and since there is only one critical 
state for each periodic input (7 + J)y is noncritical if y is noncritical. 
We can show in a similar manner that (7” + J)y is noncritical when 
y is noncritical as long as k is not a multiple of 2” — 1. Thus, 7+ 1, , 
which is produced by the starting state (7” + JI)y when y generates 
1, is the output of a basic scrambler with input period sy) and output 
period s,(2” — 1) when & is not a multiple of 2” — 1 and s» | s. Then, 
invoking (35), we have 


—1/P S$ R,(kT,) S 1/P, knot a multiple of P = 2” — 1. (55) 


We note, however, that R,(kT,) for such k may not all be equal. 

Next consider 1 + 2, when k is a multiple of 2” — 1. If 2+ 1, represents 
an output which has period which divides s, then (7* + I)(T" + Dy = 0. 
We now show that (7° + I)(7* + I) = 0 for all y when k is a multiple 
of 2” — 1. We observe that 


_ {0 0 ; 
pe 2 hy aaa oe (56) 
and 
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since 7; = I. Because T’ + IJ and 7" + I commute, we have 
T+ HM+D=-"4+D"+D) =o. (58) 


Thus, 2 -++ 1, represents a scrambler output of period s, , where Sp | s. 
It is clear then that the number of 1’s in one period of 1 + J, is greater 
than or equal to I and less than or equal to s, — 1. Also, 2 + 1, is the 
same sequence for all multiples of P = 2” — 1 which are not multiples 
of sP. Thus, for k a multiple of P which is not a multiple of sP, we have 
from (52) that 


=4 < -(==?) < RAkT>) S =?) <i== (9) 





when 8s) = 2. 

To calculate a representative spectrum of the scrambled data sequence, 
we assume that R,(7) has the following form, where u, 2 S$ wu S 2s, 
is a function of the scrambler input (the number of l’sin7?7 + 4, ka 
multiple of P, depends on the input): 





1 k=nsP, n=0, +1, +2,::-, 
RAkT,.) = \3=—"= kk =nP, nX¥0, +s, +2s,---, (60) 

i all other k 

P 


The power density spectrum S,(f) then is 
Pe sin siT a), = ( - a} 
Sif) = p a) + r.( wt, ) \SPT, 2x, \l ~ PF 


u 1} 1 << j 

+(1-#- - PT, 2. lr = oh) eo) 
When w is of the order of s we see that the second term in curly brackets 
has amplitudes which are proportional to 1/P* and are thus much 
smaller than terms in the first sum. We show F,(r) with u = s, Ri (To) = « 
in Fig. 13 and S,(f) in Fig. 14. The assumption that wu = s is equivalent 
to the assumption that 2 + 1, contains an equal number of 1’s and 0’s 
when k is a multiple of P. 

We deduce from this discussion of spectra that the principal effect 
of scrambling when the scrambled sequence is converted to a signal 
waveform in the manner given above 7s to increase the number of tones 
in a gwen banduidth by a factor which is approximately P and to decrease 
the level of each tone by approximately the same factor. 
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Fig. 13— Autocorrelation function of a scrambled periodic sequence. 


XII. AN EXAMPLE 


We shall now consider an application for the scramblers and we shall 
compare the relative cost and effectiveness of the MCS and the SCS. 
We will report on a computer simulation directed at the determination 
of the smallest SCS counter threshold for our example. 

Assume that the source is binary and that it may occasionally contain 
sequences of period 1 (there are two—the all 0 sequence and the all 
1 sequence), period 2 (there is only one—the 1010 --- sequence, known 
as dotting), period 7 or period 8. Assume also that a line sequence of 
period less than 100 is undesirable from spectra considerations. Since 
the least common multiple of 1 and 2” — 1 is 2” — 1, we will require 
that 2” — 1 > 100. The smallest value of m for which this is true 
is m = 7 for which 2” — 1 = 127, a prime. We next require a prim- 
itive, degree 7 binary polynomial for the tap polynomial. The poly- 
nomial h(x) = 1 + 2* + 2’ is one such. Given h(x) our basic scrambler 
is fixed. We next observe that 1 divides 7 and 8 and that 2 divides 8 
so that we may build a scrambler which detects two periods s, = 7 
and s, = 8. 

We next consider whether the MCS or the SCS should be used 
for our problem. We see immediately from Theorem 3 that the threshold 


sx(r)f 
1/Ps 


-1/To a 1/To f—> 
1/PT, 


Fig. 14— Spectrum of a scrambled periodic sequence. 
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on the first counter of the MCS, ¢,,, must be at least 14. Similarly, 
tz, of the MCS must be at least 13. Since a 4-stage binary counter will 
count to 16, we see directly that 8 counter stages, 8 shift register stages, 
3 OR gates and some peripheral logic will suffice to build an MCS 
for our problem. 

From Theorem 4 we see that the threshold on the SCS need not be 
any larger than 954 or require more than 10 stages of a binary counter 
since 2° = 1024 > 954. A computer simulation of the SCS, however, 
shows that the bound of 954 is more than 34 times larger than the 
smallest required threshold, which was found to be 28. The results 
of this simulation are tabulated in Table I. The largest run of con- 
secutive zeros at counter input was found for all period 7 and period 
8 sequences when the line sequences had periods 7-127 and 8-127, 
respectively. In Table I we list the fraction of the 384 periodic sequences 
which have the gap lengths (maximum run of zeros) shown. (We note 
that it is only necessary to simulate the SCS with one starting state 
of the basic scrambler when 2” — 1 is prime since all 2” — 1 noncritical 
starting states appear as states of the basic scrambler. Note also that 
we can neglect the first eight inputs to the counter following the argu- 
ment of the third paragraph of Section VII.) 

The SCS will scramble our periodic inputs if we choose a counter 
threshold of 32 which can be realized with a 5-stage binary counter. 
It will also require eight shift register stages, an AND gate and periph- 
eral logic. 

As far as random data is concerned, we see from (46) that the MCS 


TasBLeE I—Gar Lenotrus FoR PERiopic INPUTS 











Period 7 Period 8 

Gap length No. % No. % 
13 14 10.92 () 0) 
14 28 21.84 16 6.25 
15 14 10.92 16 6.25 
16 0 ~—~O 16 6.25 
17 (0) 0 16 6.25 
18 28 21.84 60 23.41 
19 14 10.92 18 7.04 
21 0 0 56 21.85 
22 2 1.56 2 0.78 
24 0 0 24 9.38 
25 0 0 16 6.25 
26 14 10.92 0 0 
2¢ 14 10.92 16 6.25 
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for our application reaches threshold at least once in n transmissions 
with probability 


P3,(n) S (n — 15)(3.06)10-° (61) 
and we see from (47) that the SCS reaches threshold with probability 
Ps(n) S (n — 31)10™. (62) 


Thus, the MCS has a slight edge on the SCS when it comes to scrambling 
random data since it is desirable to keep the frequency of threshold 
crossings low. 

In sum, it is safe to say that the SCS has the edge for our problem 
primarily because it is simpler and less expensive. Also, we note that 
the addition of a single-counter stage will reduce Ps(n) to (n — 31)10™. 
The autocorrelation function of the scrambled data sequence will be 
like that of Fig. 13 with | «| < 0.008. 


XIII. CONCLUSIONS 


We have introduced two major classes of self-synchronizing, digital 
data scramblers called multi-counter scramblers and single-counter 
scramblers. We have shown that these scramblers and combinations 
of the two will map a periodic sequence of period s into a periodic 
sequence of period LCM (s, p” — 1), where p is the size of the source 
alphabet (the SCS results require that p = 2 and that s and 2” — 1 
be relatively prime), if the basic scrambler tap polynomial A(x) of 
degree m is a primitive polynomial over GF(p). We have found the 
smallest values for the counter thresholds in the MCS and have shown 
the existence of finite thresholds for the successful operation of the SCS. 

We have shown that there are many transitions in the scrambled 
sequence and that they are well distributed. We have shown that the 
descramblers possess the self-synchronizing property and we have con- 
sidered the effect of channel errors on the descrambling process. We 
have seen that the principal effect of infrequent channel errors (occurring 
at a rate of one in 10° transmissions, say) is to cause approximately 
w(h) as many output errors, where w(h) is the number of nonzero 
terms in h(x). Channel errors were shown to have a relatively small 
effect on the output of the descrambler monitoring logic. 

We have found the power density spectrum of the waveform gen- 
erated by the scrambler output for a representative case, namely, when 
the source is binary and the scrambled sequence is mapped onto a 
+1 sequence. We have seen that scrambling does not affect the spectrum 
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of the line signal when the source is random and that its principal 
effect when the source is periodic is to introduce P times as many 
tones each having 1/Pth as much energy where P is the factor by which 
the source period is increased. . 

It has been shown that the counters in the scrambler and descrambler 
reach threshold infrequently when the source is random and at most 
once each time the source becomes periodic. Thus, it has been argued 
that the counters at the descrambler might be removed if the rate 
at which the counters at the scrambler reach threshold is less than 
the rate of occurrence of channel errors, and if the customer can tolerate 
occasional output errors when his data is periodic. 
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APPENDIX 


Proof of Theorem 2 


Let 7, be the matrix shown below where the coefficients c,, c., 
-++,¢,, are elements of the modular field GF'(p) of p elements, p a prime 


Cy Co *'* Cm-1 Cm 
1 O 0 0 

T;, = 0 1 bee 0 0 m. (63) 
0 O -:-: 1 0 


Let h(x) be the polynomial shown below in the indeterminate x where 
coefficients are those appearing in (68). 


h(a) = x" —eqa"* — ++) ~ em. (64) 


Then, one can show by direct calculation that the characteristic poly- 
nomial of 7, , ¢(x), defined by 


g(x) = det (T, — xl), (65) 
is related to h(x)’"*” by 
g(x) = (—1)"A(2). (66) 
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The matrix 7’, is called the “‘companion matrix” for the polynomial h(a). 
We assume that h(x) is a primitive polynomial over the field GF(p). 
A polynomial h(x) is primitive if 


(2) h(x) is irreducible over GF (p), that is, if there is no polynomial 
with coefficients in GF(p) which divides h(x) except 1 and h(x), itself, 
and 

(12) h(x) of degree m divides x” — 1 forn = p” — 1 but for no smaller 
integer 7. 


If we replace the term c,, in h(x) given in (64) by the matrix c,,/, 
where J is the m X m identity matrix and replace x by T7',, where 
powers of 7, are defined as successive matrix products, then we have 
the well-known Cayley-Hamilton theorem” 


aT.) = 0, (67) 


where g(x) is the characteristic polynomial of 7. Thus, a matrix 7, 
satisfies its own characteristic polynomial. There is a smallest degree 
monic polynomial (coefficient of the highest degree term is 1), called 
the minimal polynomial, m(x), such that 


mT) = 0. (68) 
Since h(x) is irreducible, we have 
ma) = h(a). (69) 


We now wish to prove the following theorem. 


Theorem 2: The matrices T; — I are nonsingular for1 Sk S p” —2. 
We first prove the following two lemmas. 
Lemma 2: I1f0 S$ i,9 S p” — 2,7 Xj, then T, # Ti. 
Proof: If T* = T’ for the 7, j given above andz < j then 
TT —D=0 | 
implies | 
T'-I=0- 


since det T, = ¢(0) ¥ 0. (If ¢(0) = 0 then g(x) is divisible by x and 
h(x) is not primitive.) Consider now the polynomial z’~™* — 1. Using 
the Euclidean division algorithm we have | . 


vt — 1 = h(a)q(x) + s(x) 
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for unique q(x) and s(x) and degree s(~) < degree h(x). Therefore, 
Ti —1=0 = A(P,)a(Ti) + (Ps) 
which implies that 
s(T,) = 0. 


But m(x) = A(x) is the minimal polynomial of 7, so that s(x) = 
Therefore, h(x) divides x” — 1,n = 7 — 17 < p” — 1. Contradiction. 
Hence, 7” ¥ T7',14#37,081,73 8 p° — 2. QED 


— 


Lenma 8: All nonzero’ polynomials in T,, with coefficients in GF(p) 
and of degree m — 1 or less are nonsingular. 


Proof: Let p(x) be a polynomial of degree m — 1 or iss with coe- 
ficients in GF(p). Then, using the Euclidean division algorithm, we 
have that the greatest common divisor, d(x), of p(x) and A(x) is given by 


d(x) = a(x)p(x) + b@)h(a), 


where a(z) and b(xz) are unique polynomials. Since A(z) has degree 
m and is irreducible d(z) = 1 and 


1 = a(x)p(z) + b(a)h(a). 
Taking these polynomials in 7’, , we have 
LT = aT )p(Tr) + 6 )ATs) 


or since A(T;,,) = 0 we have 


T=a(T,)pl,) = p(T i)aT,), 


where the latter equality follows since the polynomials a(x#) and p(x) 
commute. Thus, the polynomial p(7,,) of degree m — 1 or less with 
coefficients over GF(p) in the matrix 7, has both a left inverse and 
a right inverse and is nonsingular. QED 


Proof of Theorem 2: 
Since h(T,) = 0 we have 


Tr. Te ey rb eT? + os) tel, 


Thus, every power of T;, , such as 7;, can be written as a polynomial 
in 7, of degree m — 1 or less. Hence, 7; — Ti can be written as a 
polynomial of degree m — 1 or less in 7;,. From Lemma Al, Ti — Ti ¥ 0, 
1~j,0 524,37 S p” — 250 that T, — Ti as a polynomial in 7’, of 
degree m — 1 or less is nonzero. From Lemma A2, T; — T? is nonsingular 
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and it follows by choosing 7 = 0,7 = A with 1 Sk S p” — 2 that 
T;, — I is nonsingular. QED 


Theorem 2 in effect says that if y is some arbitrary, nonzero column 
vector of m components chosen from GF(p) then Ty runs through all 
p” — 1 nonzero vectors y as k ranges between 0 and p” — 2. Thus, 
the linear sequential filter with feedback paths described by T', 1s a max- 
imal-length sequence generator. Elspas’ comments that these results 
were noted by Zierler” and Golomb.” 
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A Camera Tube with a 
Silicon Diode Array Target 


By M. H. CROWELL, T. M. BUCK, E. F. LABUDA, 
J. V. DALTON, and E. J. WALSH 


A variety of electronic cameras have been developed for television 
systems.’ Among these the vidicon® and the Plumbicon® have the in- 
herent advantages of high sensitivity, small size, and simple mechanical 
construction. The operating principles of the vidicon and the Plumbicon 
are quite similar since they both utilize a thin photoconductive layer 
to convert the optical image to a stored charge pattern which is periodi- 
cally scanned and erased by an electron beam. Erasing the charge pattern 
creates the video signal. However, there is a distinct difference in overall 
device performance since the photoconducting target in the Plumbicon 
(PbO) is deposited in a manner to form a single, large area, graded 
p-n junction, each layer having high resistivity. In the vidicon, the 
evaporated layers of Sb.S; forming the target behave like a semi- 
insulating photoconductor. 

A new type of target consisting of an array of electrically isolated 
reverse-biased diodes, as first suggested by Reynolds,* later discussed 
by Heijne’ and more recently by Wendland’, has several valuable 
attributes. 


(z) The dark current and the light-induced current can be essentially 
independent of target (reverse bias) voltage and the response char- 
acteristic can have a gamma of unity as in the Plumbicon. 

(2z) The time constant associated with the charge leakage of an 
array of reverse-biased diodes can be very much larger than the in- 
trinsic (dielectric relaxation) time constant of the bulk material. This 
implies that an infrared responsive camera operating at room tempera- 
ture can be realized. 

(272) The spectral response can cover a wide range including the 
visible and consequently much greater and more uniform sensitivity 
can be achieved than in the vidicon or Plumbicon. 

(w) The target performance is insensitive to electron beam bombard- 
ment and is unaffected by intense light sources so that deleterious burn- 
in does not occur. 
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Fig. 1— Schematic drawing of the diode array. In practice the perimeter thickness 
was made ~4 mils to obtain a self-supporting structure. 


(v) There is no image persistence due to photoconductive lag. 

(vt) The assembled tube may be processed using standard vacuum 
techniques including a high temperature bake. 

(vit) The operating lifetime can be expected to exceed that of the 
vidicon and Plumbicon by a considerable margin. 


In this brief, experimental results obtained from targets consisting 
of a 540 X 540 array of reverse-biased Si diodes are reported. The sub- 
strate is 10 Q-cm, n-type Si, is self-supporting and can be anti-reflection 
coated. The p-type islands are formed by diffusing boron through 8u 
diameter holes in the SiO, film, the center-to-center spacing between 
holes being 20u. This arrangement provides sufficient diode capacitance 
-v1000 pufd/cm*) to integrate the diode photoresponse over the time 
interval of 1/30 sec (one frame period in commercial television). Ohmic 
contact to the array is obtained via the gold ring evaporated onto the 
n” region near the perimeter of the Si wafer chip. 

In normal operation, the electron beam, the diameter of which is 
larger than that of a single diode, periodically charges the p-type islands 
down to cathode (ground) potential while the potential of the n-type 
material is held at 5 to 10 volts. This potential difference can be 
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sustained for a normal television frame time so long as the dark current 
is <5 X 10-** amps/diode. The SiO, film, also charged down to cathode 
potential by the beam, remains there and isolates the substrate from 
the beam. The incident light associated with the image is absorbed in 
the $i, creating hole-electron pairs. Since the thickness of a self-support- 
ing wafer is 210°°cm and the absorption coefficient of Si for visible 
light is greater than 3000 cm™’, most of the hole-electron pairs will be 
generated near the incident surface; the minority carriers (holes) then 
diffuse to the depletion region of the diodes, discharging the diodes by 
an amount proportional to the light intensity. The recharging of the 
diodes by the scanning beam creates the video signal. 

An exact analytical evaluation of the performance of the diode array 
shown in Fig. 1 is quite complicated. However, with a simpler model 
in which the p-regions of the array are replaced by one large homoge- 
nious p-region with no lateral conductivity, it is possible to estimate 
the loss in light sensitivity and resolution due to minority carrier re- 
combination and diffusion. An analysis of this simpler model indicates 
that for a minority carrier lifetime of 10 usec, a surface recombination 





Fig. 2 —Photograph obtained with the 540 x 540 diode array target. The subject 
was a black and white transparency illuminated with a tungsten lamp. 
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velocity of <10* cm/sec, and a wafer thickness of 10° cm, the 
collection efficiency (ratio of collected holes to generated holes) for 
uniform illumination with visible light is +80 percent. Lateral diffusion 
will degrade the spatial resolution. For example, if the spatial variation 
in the visible light were sinusoidal with a period corresponding to 
4 < 10°° em or twice the center-to-center spacing of the diodes, the ac 
signal would be reduced to ? of the de signal. 

The performance of a 8i diode array is illustrated by Fig. 2. This 
photograph was obtained from a Kintel’ closed circuit system with 
commercial television standards. The usual vidicon camera tube was 
replaced by a tube using a 540 X 540 diode array target. The defects 
in the picture reflect a localized high dark current and can be partly 
attributed to defects in the bulk crystal from which the array was 
fabricated and to defects in the $10, film. 

The measured spectral response of a camera tube with a diode array 
target is given in Fig. 3 for two wafer thicknesses. In these measure- 
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Fig. 3— Spectral response without reflection loss corrections of experimental 
diode array targets for two wafer thicknesses. The photograph shown in Fig. 2 was 
obtained with the 1.2-mil target. 
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ments, the whole diode array was illuminated with a uniform light 
intensity and the light induced de current (average video signal) in 
the target lead was measured. Continuous laser transitions were used 
to obtain the absolute response at several wavelengths. The actual 
target collection efficiency was better than that indicated in the figure 
since no anti-reflection coating was used and no reflection loss corrections 
were made. For Si, the reflection coefficient varies from 30 percent in 
the near infrared to 65 percent in the blue portion of the spectrum. ° 
With a single layer anti-reflection coating, the reflection can be reduced 
to a few percent. This implies that if such a coating had been used on 
the experimental targets a maximum collection efficiency of 20 per- 
cent would have been obtained. At this maximum, the sensitivity would 
have been 0.16 pamps/pywatt. Because of its wider spectral response, 
the camera tube with a diode array target was 25 times more sensitive 
than an 8134 RCA vidicon for illumination with an incandescent lamp 
at normal operating temperature. The measured gamma was unity. 

The observed dark current for the entire array was 5 X 107° amps 
for a reverse bias of 5 to 10 volts. This implies that the leakage current 
per diode was %2 X 107° amps. The resolution was not limited by 
leakage between diodes. 
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